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COMPOSITIONS AND METHODS UTILIZING DNA POLYMERASES 

FIELD OF THE INVENTION 
The present invention relates to compositions and methods utilizing DNA 
polymerase enzymes with reduced discrimination for non-conventional nucleotides. The 
10 enzymes of the invention are useful in many applications calling for the detectable 

labeling of nucleic acids and are particularly useful in DNA sequencing applications. 

BACKGROUND OF THE INVENTION 
Detectable labeling of nucleic acids is required for many applications in 
molecular biology, including applications for research as well as clinical diagnostic 

1 5 techniques. A commonly used method of labeling nucleic acids uses one or more 

unconventional nucleotides and a polymerase enzyme that catalyzes the template- 
dependent incorporation of the unconventional nucleotide(s) into the newly synthesized 
complementary strand. 

The ability of a DNA polymerase to incorporate the correct deoxynucleotide is the 

20 basis for high fidelity DNA replication in vivo. Amino acids within the active site of 

polymerases , form a specific binding pocket that favors the placement of the correct 
complementary nucleotide opposite the template nucleotide. If a mismatched nucleotide, 
ribonucleotide, or nucleotide analog fills that position, the precise alignment of the amino 
acids contacting the incoming nucleotide may be distorted into a position unfavorable for 

25 DNA polymerization. Because of this, the unconventional nucleotides or nucleotide 

analogs used to label DNA tend to be incorporated into the elongated strand less 
efficiently than do the standard deoxynucleotide triphosphates (dNTPs; the so-called 
"standard" dNTPs include deoxy adenosine triphosphate (dATP), deoxycytosine 
triphosphate (dCTP), deoxyguanosine triphosphate (dGTP), and thymidine triphosphate 

30 (TTP)). 

The reduced efficiency with which unconventional nucleotides are incorporated 
by the polymerase increases the amount of the unconventional nucleotide necessary for 
DNA labeling. The reduced efficiency of incorporation of a particular nucleotide can 
also adversely affect the performance of techniques or assays, such as DNA sequencing, 
35 that depend upon unbiased incorporation of unconventional nucleotides for homogeneous 

signal strength. 
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5 The identity and exact arrangement of the amino acids of a DNA polymerase that 

contact an incoming nucleotide triphosphate determine the nature of the nucleotides, both 
conventional and unconventional, that may be incorporated by that polymerase enzyme. 
Changes in the exact placement of the amino acids that contact the incoming nucleotide 
triphosphate at any stage of binding or chain elongation can dramatically alter the 

10 polymerase's capacity for utilization of unusual or unconventional nucleotides. 

Sometimes changes in distant amino acids can influence the incorporation of nucleotide 
analogs due to indirect global or structural effects. Polymerases with increased capacity 
to incorporate nucleotide analogs are useful for labeling DNA or RNA strands with 
nucleotides modified with signal moieties such as dyes, reactive groups or unstable 

15 isotopes. 

In addition to labeled nucleotides, an extremely important class of modified 
nucleotides is the dideoxynucleotides. The so-called "Sanger" or "dideoxy" DNA 
sequencing method (Sanger et aL, 1977, Proc. Natl. Acad. Sci. USA 74: 5463, which is 
incorporated herein by reference) relies upon the template-directed incorporation of 

20 nucleotides onto an annealed primer by a DNA polymerase from a mixture containing 

deoxy- and dideoxynucleotides. The incorporation of a dideoxynucleotide results in 
chain termination, the inability of the enzyme to catalyze further extension of that strand. 
Electrophoretic separation of reaction products results in a "ladder" of extension products 
wherein each extension product ends in a particular dideoxynucleotide complementary to 

25 the nucleotide opposite it in the template. The distance of the dideoxynucleotide analog 

from the primer is indicated by the length of the extension product. When four reactions, 
each containing one of the four dideoxynucleotide analogs ddA, ddC, ddG, or ddT 
(ddNTPs) are separated on the same gel, the sequence of the template may be read 
directly from the ladder patterns. Extension products may be detected in several ways, 

30 including for example, the inclusion of isotopically- or fluorescently-labeled primers, 

deoxynucleotide triphosphates or dideoxynucleotide triphosphates in the reaction. 

Fluorescent labeling has the advantages of faster data collection, since detection 
may be performed while the gel is running, and longer reads of sequence data from a 
single reaction and gel. Further, fluorescent sequence detection has allowed sequencing 

35 to be performed in a single reaction tube containing four differentially-labeled fluorescent 

dye terminators (the so-called dye-terminator method, Lee et al., 1992, Nucleic Acids 
Res. 20: 2471, incorporated herein by reference). 

A desirable quality of a polymerase useful for DNA sequencing is improved 
incorporation of dideoxynucleotides. Improved incorporation of dideoxynucleotides can 
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make processes such as DNA sequencing more cost effective by reducing the 
requirement for expensive radioactive or fluorescent dye-labeled dideoxynucleotides. 
Moreover, unbiased dideoxynucleotide incorporation provides improved signal 
uniformity, leading to increased accuracy of base determination. The even signal output 
further allows subtle sequence differences caused by factors like allelic variation to be 
detected. Allelic variation, which produces two different half strength signals at the 
position of relevance, can easily be concealed by the varied signal strengths caused by 
polymerases with non-uniform ddNTP utilization. 

Incorporation of ribonucleotides by the native form of DNA polymerase is a rare 
event. Mutants that incorporate higher levels of ribonucleotides can be used for 
application s such as sequencing by partial ribosubstitution. In this system, a mixture of 
ribonucleotides and deoxynucleotides corresponding to the same base are incorporated by 
the mutant polymerase (Barnes, 1978 J. Mol. Biol 119:83-99). When the ribosequencing 
reactions are exposed to alkaline conditions and heat, fragmentation of the extended 
strand occurs. If the reactions for all four bases are separated on a denaturing acrylamide 
gel, they produce a sequencing ladder, there is a need in the art for polymerase mutants 
with higher utilization of ribonucleotides for this alternative method of sequencing. 

Alternatively, the incorporation of ribonucleotides followed by alkaline 
hydrolysis could be utilized in a system that requires random cleavage of DNA molecules 
such as DNA shuffling ((Stemmer, 1994, Nature, 370: 389-391) which has also been 
called molecular breeding, sexual PCR and directed evolution). 

Another desirable quality in a DNA labeling enzyme is thermal stability. DNA 
polymerases exhibiting thermal stability have revolutionized many aspects of molecular 
biology and clinical diagnostics since the development of the polymerase chain reaction 
(PCR), which uses cycles of thermal denaturation, primer annealing, and enzymatic 
primer extension to amplify DNA templates. The prototype thermostable DNA 
polymerase is Taq polymerase, originally isolated from the thermophilic eubacterium 
Thermus aquaticus. So-called "cycle sequencing" reactions using thermostable DNA 
polymerases have the advantage of requiring smaller amounts of starting template 
relative to conventional (i.e., non-cycle) sequencing reactions. 

There are three major families of DNA polymerases, termed families A, B and C. 
The classification of a polymerase into one of these three families is based on structural 
similarity of a given polymerase to E. coli DNA polymerase I (Family A), II (Family B) 
or III (family C). As examples, Family A DNA polymerases include, but are not limited 
to Klenow DNA polymerase, Thermus aquaticus DNA polymerase I (Taq polymerase) 



WO 01/32887 



PCT/US00/29706 



and bacteriophage T7 DNA polymerase; Family B DNA polymerases, formerly known as 
-family polymerases (Braithwaite and Ito, 1991, Nuc. Acids Res. 19:4045), include, but 
are not limited to human , and DNA polymerases, T4, RJB69 and 29 bacteriophage 
DNA polymerases, and Pyrococcus furiosus DNA polymerase (Pfu polymerase); and 
family C DNA polymerases include, but are not limited to Bacillus subtilis DNA 
polymerase III, and E. coli DNA polymerase III and subunits (listed as products of 
the dnaE and dnaQ genes, respectively, by Brathwaite and Ito, 1993, Nucleic Acids Res. 
2 1 : 787). An alignment of DNA polymerase protein sequences of each family across a 
broad spectrum of archaeal, bacterial, viral and eukaryotic organisms is presented in 
Braithwaite and Ito (1993, supra), which is incorporated herein by reference. 

The term used to describe the tendency of DNA polymerases to not to carry the 
incorporation of unnatural nucleotides into the nascent DNA polymer is "discrimination". 
In Family A DNA polymerases, the effective discrimination against incorporation of 
dideoxynucleotide analogs is largely associated with a single amino acid residue. The 
majority of en2ymes from the Family A DNA polymerases have a phenylalanine (phe or 
F) residue at the position equivalent to F762 in E. coli Klenow fragment of DNA 
polymerase and demonstrate a strong discrimination against dideoxynucleotides. A few 
polymerases (e.g. T7 DNA polymerase) have a tyrosine (tyr or Y) residue at the 
corresponding position and exhibit relatively weak discrimination against 
dideoxynucleotides. Family A polymerases with tyrosine at this position readily 
incorporate dideoxynucleotides at levels equal to or only slightly different from the levels 
at which they incorporate deoxynucleotides. Conversion of the tyrosine or phenylalanine 
residues in the site responsible for discrimination reverses the dideoxynucleotide 
discrimination profile of the Family A enzymes (Tabor and Richardson, 1995, Proc. Natl. 
Acad. Sci. USA 92:6449). 

Among the thermostable DNA polymerases, 'a mutant form of the Family A DNA 
polymerase from Thermus aqaticus, known as AmpliTaq FS® (Perkin Elmer), contains a 
F667Y mutation at the position equivalent to F762 of Klenow DNA polymerase and 
exhibits increased dideoxynucleotide uptake (i.e., reduced discrimination against 
ddNTPs) relative to the wild-type enzyme. The reduced discrimination for 
dideoxynucleotide uptake makes it more useful for fluorescent and labeled 
dideoxynucleotide sequencing than the wild-type enzyme. 

The F667Y mutant of Taq DNA polymerase is not suited, however, for use with 
fluorescein-labeled dideoxynucleotides, necessitating the use of rhodamine dye 
terminators. Rhodamine dye terminators that are currently utilized with Taq sequencing 
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reactions, however, stabilize DNA secondary structure, causing compression of signal. 
Efforts to eliminate compression problems have resulted in systems that use high 
amounts of the nucleotide analog deoxyinosine triphosphate (dITP) in place of 
deoxyguanosine triphosphate. While incorporation of (dITP) reduces the compression of 
the signal, the presence of dITP in the reaction produces additional complications 
including lowered reaction temperatures and increased reaction times. Additionally, the 
use of rhodamine dyes in sequencing requires undesirable post-reaction purification 
(Brandis, 1999 Nuc. Acid Res. 27:1912). 

Family B DNA polymerases exhibit substantially different structure compared to 
Family A DNA polymerases, with the exception of the position of acidic residues 
involved in catalysis in the so-called palm domain (Wang et al., 1997, Cell 89:1087; 
Hopfher et al., 1999, Proc. Natl. Acad. Sci. USA 96:3600). The unique structure of 
Family B DNA polymerases may permit a completely different spectrum of interactions 
with nucleotide analogs, perhaps allowing utilization of analogs which are unsuitable for 
use with Family A DNA polymerases due to structural constraints. Thermostable Family 
B DNA polymerases have been identified in hyperthermophilic archaea. These organisms 
grow at temperatures higher than 90 1 C and their enzymes demonstrate greater 
themostability (Mathur et al., 1992, Stratagies 5:11) than the thermophilic eubacterial 
Family A DNA polymerases. Family B polymerases from hyperthermophilic archaea 
may be well suited starting substrates for modification(s) to reduce discrimination against 
non-conventional nucleotides. 

Although the crystal structures of three Family B DNA polymerases have been 
solved (Wang et al., 1997, supra; Hopfher, K.-P. et al., 1999, Proc. Natl. Acad. Sci. 96: 
3600; Zhao, 1999, Structure Fold Des., 7:1 189), the structures of DNA-polymerase or 
dNTP-polymerase co-complexes have not yet been reported. At present, identification of 
amino acid residues contributing to nucleotide analog discrimination can only be inferred 
from extrapolation to Family A-dNTP structures or from mutagenesis studies carried out 
with related Family B DNA polymerases (e.g., human pol , phage T4, phage 29, T. 
litoralis DNA polymerase). 

Sequence comparison of the Family B DNA polymerases indicate six conserved 
regions numbered I-VI (Braithwaite and Ito, 1993, supra). The crystal structure of 
bacteriophage RB69 DNA polymerase (Family B) proposed by Wang et al. (Wang et al., 
1997, supra) shows that Y416 in region II ( which corresponds to Y409 in the Family B 
DNA polymerase of Thermococcus species JDF-3) has the same position as Yl 15 in HIV 
reverse transcriptase (RT) and E710 in the Klenow fragment (Family A polymerases). 
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5 Modeling of the dNTP and primer template complex in RB69 was carried out using the 

atomic coordinates of the reverse transcriptase-DNA cocrystal. This model predicts the 
RB69 Y416 packs under the deoxyribose portion of the dNTP. Tyrosine at this position 
has been implicated in ribose selectivity, contributing to polymerase discrimination 
between ribonucleotides and deoxribonucleotides in mammalian reverse transcriptases 

10 (Yl 15) (Gao et al., 1997, Proc. Natl Acad. Sci. USA 94:407; Joyce, 1994, Proc. Natl. 

Acad. Sci. USA 94:1619) and in Family A DNA polymerases where modification of the 
corresponding invariable glutamate residue (E710) reduces discrimination against 
ribonucleotides (Gelfand et al., 1998, Pat. No. EP0823479; Astatke et al., 1998, Proc. 
Natl. Acad. Sci. USA 96:3402). 

15 Mutagenesis studies done in Family B DNA polymerases also implicate the 

region containing the analogous Y in region II in dNTP incorporation and ribose 
selectivity. Mutations at the corresponding Y865 in human DNA polymerase affect 
polymerase fidelity and sensitivity to dNTP nucleotide inhibitors such as AZT-TP, which 
has a bulky 3'-azido group in place of the 3'-OH group, BuPdGTP, which contains a 

20 butylphenyl group attached to the amino group at the C-2 position in the guanine base of 

dGTP (resulting in a bulkier and more hydrophobic purine base nucleotide) and 
aphidicolin, a competitive inhibitor of pyrimidine deoxynucleotide triphosphate. 
Interestingly, the mutants showed no difference in their uptake of ddCTP (Dong et al., 
1993, J. Biol. Chem. 268: 26143). Additionally, mutants of bacteriophage T4 DNA 

25 polymerase, which have converted L412 to methionine (M) or isoleucine (I) just one 

amino acid before the analogous Y (Y41 1), show extreme and mild sensitivity, 
respectively, to the inorganic pyrophosphate analog phosphonoacetic acid (PAA). 
Alterations in PAA sensitivity have been shown to predict polymerase interactions with 
nucleotide analogs. L412 in T4 DNA polymerase corresponds to L410 in Thermococcus 

30 species JDF-3 DNA polymerase. The L412M T4 DNA polymerase mutant was inhibited 

with 50-fold less ddGTP than wild-type polymerase while the K m s for dGTP was similar. 
As stated by the authors in that study, "[d]espite the sensitivity of the L412M DNA 
polymerase to ddGTP, there was no difference found in the incorporation of ddNTPs by 
wild-type and L412M DNA polymerase." (Reha-Krantz et al., 1993, J. Virol. 67:60). In 

35 bacteriophage 29, mutations in region II (LYP where Y is analogous to Thermococcus 

species JDF3 DNA polymerase Y409) produce mixed results when challenged with 
PAA; P255S was hypersensitive to PAA while L253V was shown to be less sensitive 
than the wild-type enzyme (Blasco et al., 1993, J. Biol. Chem. 268: 24106). These data 
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5 support the role of the LYP region (region II) in polymerase-nucleotide interactions, but 

improved incorporation of ddNTPs was not achieved in these references. 

In another study, extensive mutation of region II in the archaeal Family B DNA 
polymerase from Thermococcus litoralis DNA polymerase (VENT™ polymerase, New 
England Biolabs) was performed. In that study, 26 different site-directed mutants were 

1 0 made for the sole intent of examining nucleotide analog discrimination (Gardner and 

Jack, 1999, Nucleic Acids Res. 27: 2545). Site-directed mutagenesis of VENT™ DNA 
polymerase demonstrated that three mutations at Y412 (which corresponds to JDF-3 
DNA polymerase Y409) could alter nucleotide binding (Gardner and Jack, 1999, supra). 
Y412V was most significant with a 2 fold increase in dideoxynucleotide incorporation 

15 and a 200 fold increase in the incorporation of ribonucleotide ATP. The mutation Y412F 

showed no change in analog incorporation. 

Region III of the Family B polymerases (also referred to as motif B) has also been 
demonstrated to play a role in nucleotide recognition. This region, which corresponds to 
AA 487 to 495 of JDF-3 Family B DNA polymerase, has a consensus sequence 

20 KX 3 NSXYG (Jung et al., 1990, supra; Blasco et al., 1992, supra; Dong et al., 1993, J. 

Biol. Chem. 268:21 163; Zhu et al., 1994, Biochem. Biophys. Acta 1219:260; Dong and 
Wang, 1995, J. Biol. Chem. 270:21563), and is functionally, but not structurally (Wang et 
al., 1997, supra), analogous to KX 3 (F/Y)GX 2 YG in helix O of the Family A DNA 
polymerases. In Family A DNA polymerases, such as the Klenow fragment and Taq 

25 DNA polymerases, the O helix contains amino acids that play a major role in dNTP 

binding (Astatke et al., 1998, J. Mol. Biol. 278:147; Astatke et al., 1995, J. Biol. Chem. 
270:1945; Polesky et al., 1992, J. Biol. Chem 267:8417; Polesky et al., 1990, J. Biol. 
Chem. 265:14579; Pandey et al., 1994, J. Biol. Chem. 269:13259; Kaushik et al., 1996, 
Biochem. 35:7256). Specifically, helix O contains the F (F763 in the Klenow fragment; 

30 F667 in Taq) which confers ddNTP discrimination in Family A DNA polymerases 

(KX 3 (F/Y)GX 2 YG) (Tabor and Richardson, 1995, supra). 

Directed mutagenesis studies in region III of VENT™ DNA polymerase also 
targeted an alanine analogous to A485 of the Thermococcus species JDF-3 DNA 
polymerase (Jung et al., 1990, supra). These mutants (A-»C, A— >S, A->L, A A->F 

35 and A->V) exhibited a range of specific activities from 0.12 to 1 .2 times the polymerase 

activity of the progenitor enzyme (Gardner and Jack, 1999, Nucl. Acids Res. 27:2545). 
The dideoxynucleotide incorporation ranged from 4 to 15 times the unmutated enzyme. 
Interestingly, the mutant with the highest dideoxynucleotide incorporation (15x) had a 
specific activity of only 0.1 2x of the original enzyme. 
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5 Site-directed mutagenesis studies on the Family B DNA polymerase from 

Thermococcus barossii modified each residue independently in the sequence ILANSF, 
which corresponds to AA residues 488-493 of the JDF-3 DNA polymerase, to tyrosine 
(Reidl et al., U.S. Patent No. 5,882,904). That study indicated that an L489Y mutant 
exhibits approximately 3 times greater incorporation of dideoxynucleotides relative to an 

1 0 enzyme bearing the wild-type leucine residue at this site. 

One area of active research involves the use of nucleic acid arrays, often referred 
to as nucleic acid or DNA "chips", in the simultaneous analyses of multiple different 
nucleic acid sequences. Many of these applications, such as those described in U.S. 
Patent No. 5,882,904 (Reidl et al., issued March 16, 1999) will benefit from DNA 

15 polymerases exhibiting reduced discrimination against non-conventional nucleotides, 

particularly fluorescently-labeled non-conventional nucleotides. Applications being 
addressed in the chip format include DNA sequencing and mutation detection, among 
others. For example, the "mini-sequencing" methods (e.g., Pastinen et al., 1997, Genome 
Res. 7: 606; Syvanen, 1999, Human Mutation 13: 1-10) and the arrayed primer extension 

20 (APEX) mutation detection method (Shumaker et al., 1996, Hum. Mutat. 7: 346) and 

methods like them can benefit from DNA polymerases with reduced discrimination 
against fluorescently-labeled or other non-conventional nucleotides. There is a need in 
the art for a non-discriminating DNA polymerase for use in chip or gel based mini- 
sequencing systems. Such a system would advantageously permit detection of 

25 multiplexed single nucleotide polymorphisms (SNPs) and allow for quantitative 

genotyping. Identification of sequence variation permits the diagnosis and treatment of 
genetic disorders, predisposition to multifactorial diseases, and sensitivity to new or 
existing pharmaceutical products. 

There is a need in the art for DNA polymerases with reduced discrimination 

30 against unconventional nucleotides. There is particularly a need in the art for 

thermostable DNA polymerases exhibiting reduced discrimination against 
dideoxynucleotides, and further, for DNA polymerases exhibiting reduced discrimination 
against fluorescently labeled dideoxynucleotides. 

SUMMARY OF THE INVENTION 

35 The present invention relates to compositions and methods utilizing DNA 

polymerase enzymes exhibiting reduced discrimination against non-conventional 
nucleotides. Enzymes with this quality are useful in many applications calling for the 
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5 detectable labeling of nucleic acids and are particularly useful in DNA sequencing 

applications. 

The invention further relates to a Family B DNA polymerase having one or more 
mutations at a site or sites corresponding to L408, P410, S345, and/or A485 of SEQ ID 
NO: 2, or a fragment thereof which retains the ability to direct the template-dependent 
10 polymerization of nucleic acid. The invention also encompasses mutants and modified 

versions (e.g., reversibly inactivated versions of a Family B polymerase prepared, for 
example, by chemical modification or antibody complexing) of a Family B polymerase 
mutated at sites corresponding to L408, P410 and or A485 of SEQ ID NO: 2. 

In one embodiment, the DNA polymerase has a dual mutation comprising 
15 comprising a serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 2; 

and a proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

The invention encompasses purified thermostable DNA polymerase having an 
amino acid sequence presented in SEQ ID NO: 2 from residue 1 to 776. 

In one embodiment, the thermostable DNA polymerase is isolated from 
20 Thermococcus species JDF-3. 

In another embodiment, the thermostable polymerase is isolated from a 
recombinant organism transformed with a vector that codes for the expression of 
Thermococcus species JDF-3 DNA polymerase. 

The invention further encompasses a recombinant vector comprising the 
25 nucleotide sequence presented in SEQ ID NO: 1 . 

The invention further encompasses an isolated recombinant polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2 or a functional fragment thereof. 

The invention further encompasses an isolated recombinant DNA polymerase 
from Thermococcus species JDF-3 that is 3' to 5' exonuclease deficient. 
30 In one embodiment, the isolated recombinant DNA polymerase of has an aspartic 

acid to threonine or alanine mutation at the amino acid corresponding to D 141 of SEQ ID 
NO: 2 or a glutamic acid to alanine mutation at the amino acid corresponding to El 43 of 
SEQ ID NO: 2. 

In another embodiment, the isolated recombinant DNA polymerase has an 
35 aspartic acid to threonine or alanine mutation at the amino acid corresponding to D 141 of 

SEQ ID NO: 2 and a glutamic acid to alanine mutation at the amino acid corresponding 
toE143 of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant DNA polymerase 
having reduced discrimination against non-conventional nucleotides. 
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In one embodiment, the DNA polymerase is a Family B DNA polymerase. 

In another embodiment, the DNA polymerase further comprises a mutation 
selected from the group consisting of: a leucine to histidine mutation at a site 
corresponding to L408 of SEQ ID NO: 2; a leucine to phenylalanine mutation at a site 
corresponding to L408 of SEQ ID NO: 2; a proline to leucine mutation at a site 
corresponding to P410 of SEQ ID NO: 2; and an alanine to threonine mutation at a site 
corresponding to A485 of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant DNA polymerase 
having the alanine to threonine mutation at the site corresponding to A485 of SEQ ID 
NO:2 further comprising a mutation selected from the group consisting of: a leucine to 
histidine mutation at a site corresponding to L408 of SEQ ID NO: 2; a leucine to 
phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2; and a proline 
to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant DNA polymerase 
having the a proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 
2, further comprising of serine to proline mutation at a site corresponding to S345 of SEQ 
ID NO: 2 

In another embodiment, the DNA polymerase has reduced discrimination against 
a non-conventional nucleotide selected from the group consisting of: dideoxynucleotides, 
ribonucleotides and conjugated nucleotides. 

In another embodiment, conjugated nucleotide is selected from the group 
consisting of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled 
nucleotides, chemiluminescently labeled nucleotides and quantum dot labeled 
nucleotides. 

The invention further encompasses an isolated recombinant Family B DNA 
polymerase comprising an alanine to threonine mutation at the site corresponding to 
A485 of SEQ ID NO: 2 or a mutation at a site corresponding to L408 or P410 of SEQ ID 
NO: 2, wherein the DNA polymerase has reduced discrimination against non- 
conventional nucleotides relative to the wild-type form of that polymerase. 

In one embodiment, the Family B DNA polymerase is 3' to 5' exonuclease 
deficient. 

In another embodiment, the Family B DNA polymerase has a mutation at an 
amino acid corresponding to D141 or E143 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase has an aspartic acid to 
threonine or alanine mutation at a site corresponding to D 141 of SEQ ID NO: 2. 
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5 In another embodiment, the Family B DNA polymerase has a glutamic acid to 

alanine mutation at a site corresponding to El 43 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase has a glutamic acid to 
alanine mutation at a site corresponding to El 43 of SEQ ID NO: 2 and has an aspartic 
acid to threonine or alanine mutation at the amino acid corresponding to D 141. of SEQ ID 
10 NO: 2. 

In another embodiment, the Family B DNA polymerase is thermostable. 
In another embodiment, the Family B DNA polymerase is archaeal. 
In another embodiment, the Family B DNA polymerase comprises a leucine to 
histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 
15 In another embodiment, the Family B DNA polymerase comprises a leucine to 

phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a proline to 
leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises an alanine to 
20 threonine mutation at a site corresponding to A485 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprising an alanine to 
threonine mutation at a site corresponding to A485 of SEQ ID NO: 2 comprises a leucine 
to histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprising an alanine to 
25 threonine mutation at a site corresponding to A485 of SEQ ID NO: 2 comprises a leucine 

to phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprising an alanine to 
threonine mutation at a site corresponding to A485 of SEQ ID NO: 2 comprises a proline 
to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 
30 In another embodiment, the Family B DNA polymerase comprising a proline to 

leucine mutation at a site corresponding to P410 of SEQ ID NO: 2, further having a 
serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a serine to 
proline mutation at a site corresponding to S345 of SEQ ID NO: 2, and may further 
35 comprise a mutation at a site corresponding to T604 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a tyrosine to 
cysteine mutation at a site corresponding to Y497 of SEQ ID NO: 2, and may further 
comprise an isoleucine to valine mutation at a site corresponding to 1630 of SEQ ID 
NO:2. 
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In another embodiment, the Family B DNA polymerase comprises a glutamic acid 
to lysine mutation at a site corresponding to E645 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a glutamic acid 
to lysine mutation at a site corresponding to E578 of SEQ ID NO: 2, and may further 
comprise an arginine to methionine mutation at a site corresponding to R465 of SEQ ID 
NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a leucine to 
glutamine mutation at a site corresponding to L396 of SEQ ID NO: 2, and may further 
comprise a mutation at a site corresponding to V401, N424, P569, E617, or V640 of SEQ 
ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a serine to 
asparagene mutation at a site corresponding to S65 1 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a leucine to 
proline mutation at a site corresponding to L396 of SEQ ID NO: 2, and may further 
comprise a mutation at a site corresponding to E459 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a leucine to 
proline mutation at a site corresponding to L456 of SEQ ID NO: 2, and may further 
comprise a mutation at a site corresponding to E658 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a leucine to 
histidine mutation at a site corresponding to L408 of SEQ ID NO: 2, and may further 
comprise a mutation at a site corresponding to V437, or L478 of SEQ ID NO: 2. The 
L408H mutation was isolated both in the dideoxynucleotide and the dye- 
dideoxynucleotide screens described herein. 

In another embodiment, the Family B DNA polymerase comprises an tyrosine to 
asparagine mutation at a site corresponding to Y496 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase has reduced 
discrimination against a non-conventional nucleotide selected from the group consisting 
of: dideoxynucleotides, ribonucleotides and conjugated nucleotides. 

In another embodiment, the conjugated nucleotide is selected from the group 
consisting of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled 
nucleotides, chemiluminescently labeled nucleotides and quantum dot labeled 
nucleotides. 

In another embodiment, an isolated recombinant DNA polymerase having 
reduced discrimination against non-conventional nucleotides or an isolated recombinant 
Family B DNA polymerase comprising an alanine to threonine mutation at the site 
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5 corresponding to A485 of SEQ ID NO: 2 or a mutation at a site corresponding to L408 or 

P410 of SEQ ID NO: 2, wherein the DNA polymerase has reduced discrimination against 
non-conventional nucleotides relative to the wild-type form of that polymerase further 
comprises a mutation at an amino acid residue in the polymerase that corresponds to a 
mutation selected from the group consisting of: a Y to V mutation at amino acid 409 of 
10 SEQ ID NO:2; an A to C, S, L, I, F, or V mutation at amino acid 485 of SEQ ID NO: 2; a 

Y to S mutation at amino acid 494 of SEQ ID NO: 2; a Y to L mutation at amino acid 
496 of SEQ ID NO: 2; and an A to Y mutation at amino acid 490 of SEQ ID NO: 2. 

In another embodiment, an isolated recombinant DNA polymerase having 
reduced discrimination against non-conventional nucleotides or an isolated recombinant 
1 5 Family B DNA polymerase comprising an alanine to threonine mutation at the site 

corresponding to A485 of SEQ ID NO: 2 or a mutation at a site corresponding to L408 or 
P410 of SEQ ID NO: 2, wherein the DNA polymerase has reduced discrimination against 
non-conventional nucleotides relative to the wild-type form of that polymerase further 
comprises a mutation at an amino acid of the polymerase corresponding to one of amino 
20 acids 483 to 496, inclusive, of SEQ ID NO: 2. 

In one embodiment, the mutation is at an amino acid of the polymerase 
corresponding to one of amino acids 485, 490, 494, or 496 of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant Family B DNA 
polymerase comprising an alanine to threonine mutation at an amino acid corresponding 
25 to A485T of SEQ ID NO: 2 and at least one substitution in the polymerase of an amino 

acid corresponding to L408, Y409, or P410, respectively, of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant Family B DNA 
polymerase comprising an amino acid other than A at an amino acid of the polymerase 
corresponding to A485 of SEQ ID NO: 2, and at least one substitution in the polymerase 
30 of an amino acid corresponding to L408, Y409, or P410, respectively, of SEQ ID NO: 2. 

The invention further encompasses a recombinant vector comprising a nucleic 
acid sequence encoding the Family B DNA polymerase. 

The invention further encompasses a method of labeling a complementary strand 
of DNA, the method comprising the step of contacting a template DNA molecule with a 
35 recombinant Family B DNA polymerase from Thermococcus species JDF-3, wherein the 

DNA polymerase has reduced discrimination against non-conventional nucleotides, and a 
non-conventional nucleotide, under conditions and for a time sufficient to permit the 
DNA polymerase to synthesize a complementary DNA strand and to incorporate the non- 
conventional nucleotide into the synthesized complementary DNA strand. 
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The invention further encompasses a method of labeling a complementary strand 
of DNA, the method comprising the step of contacting a template DNA molecule with a 
recombinant Family B DNA polymerase comprising an alanine to threonine mutation at a 
site corresponding to A485 of SEQ ID NO: 2 or a mutation at a site corresponding to 
L408 or P410 of SEQ ID NO: 2, wherein the DNA polymerase has reduced 
discrimination against non-conventional nucleotides, and a non-conventional nucleotide, 
under conditions and for a time sufficient to permit the DNA polymerase to synthesize a 
complementary DNA strand and to incorporate the non-conventional nucleotide into the 
synthesized complementary DNA strand- 
In one embodiment, the recombinant Family B DNA polymerase is 3' to 5' 
exonuclease deficient. 

In another embodiment, the recombinant Family B polymerase comprises a 
leucine to histidine mutation at a site corresponding to amino acid L408 of SEQ ID NO: 
2. 

In another embodiment, the recombinant Family B polymerase comprises a 
leucine to phenylalanine mutation at a site corresponding to amino acid L408 of SEQ ID 
NO: 2. 

In another embodiment, the recombinant Family B polymerase comprises a 
proline to leucine mutation at a site corresponding to amino acid P410 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase comprises an 
alanine to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 
2. 

In another embodiment, the recombinant Family B polymerase comprising an 
alanine to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 
2 comprises a leucine to histidine mutation at an amino acid corresponding to L408 of 
SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase comprising an 
alanine to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 
2 comprises a leucine to phenylalanine mutation at an amino acid corresponding to L408 
of SEQ ID NO: 2, 

In another embodiment, the recombinant Family B polymerase comprising an 
alanine to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 
2 comprises a proline to leucine mutation at an amino acid corresponding to P410 of SEQ 
ID NO: 2. 
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5 In another embodiment, the recombinant Family B polymerase has reduced 

discrimination against a non-conventional nucleotide selected from the group consisting 
of: dideoxynucleotides, ribonucleotides, and conjugated nucleotides. 

In another embodiment, the conjugated nucleotide is selected from the group 
consisting of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled 
10 nucleotides, chemiluminescently labeled nucleotides and quantum dot labeled 

nucleotides. 

The invention further encompasses a method of sequencing DNA comprising the 
steps of contacting a DNA strand to be sequenced with a sequencing primer, a 
recombinant Family B DNA polymerase from Thermococcus species JDF-3, wherein the 
15 DNA polymerase has reduced discrimination against non-conventional nucleotides, and a 

chain-terminating nucleotide analog, under conditions that permit the DNA polymerase 
to synthesize a complementary DNA strand, and to incorporate nucleotides into the 
synthesized complementary-DNA strand, wherein incorporation of a chain-terminating 
nucleotide analog results in the termination of chain elongation, such that the nucleotide 
20 sequence of the template DNA strand is determined. 

The invention further encompasses a method of sequencing DNA comprising the 
steps of contacting a DNA strand to be sequenced with a sequencing primer, a 
recombinant Family B DNA polymerase comprising an alanine to threonine mutation at a 
site corresponding to A485 of SEQ ID NO: 2 or a mutation at a site corresponding to 
25 L408, S345 or P41 0 of SEQ ID NO: 2, where the DNA polymerase has reduced 

discrimination against non-conventional nucleotides, and a chain-terminating nucleotide 
analog, under conditions that permit the DNA polymerase to synthesize a 
complementary DNA strand, and to incorporate nucleotides into the synthesized 
complementary DNA strand, wherein incorporation of a chain-terminating nucleotide 
30 analog results in the termination of chain elongation, such that the nucleotide sequence of 

the template DNA strand is determined. 

In one embodiment, the recombinant DNA polymerase is deficient in 3' to 5' 
exonuclease activity. 

In another embodiment, the recombinant Family B polymerase has a leucine to 
35 histidine mutation at a site corresponding to amino acid L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase has a leucine to 
phenylalanine mutation at a site corresponding to amino acid L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase has a proline to 
leucine mutation at a site corresponding to amino acid P410 of SEQ ID NO: 2. 
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In another embodiment, the Family B DNA polymerase comprising a proline to 
leucine mutation at a site corresponding to P410 of SEQ ID NO: 2, further having a 
serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase has an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase having an alanine 
to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 has a 
leucine to histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase having an alanine 
to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 has a 
leucine to phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase having an alanine 
to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 has a 
proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

In another embodiment, the chain-terminating nucleotide analog is a 
dideoxynucleotide. 

In another embodiment, the dideoxynucleotide is detectably labeled. 

In another embodiment, the dideoxynucleotide is fluorescently labeled. 

In another embodiment, the dideoxynucleotide is labeled with a moiety selected 
from the group consisting of fluorescein and rhodamine. 

The invention also encompasses a kit for performing the methods disclosed 

herein. 

The invention also encompasses methods of making a recombinant DNA 
polymerase as disclosed here, comprising culturing a host cell containing a nucleic acid 
sequence encoding said polymerase under conditions which permit production of said 
DNA polymerase. 

The invention encompasses a mixture of a mutant DNA polymerase described 
herein and another DNA polymerase such as Taq DNA polymerase (preferably the 
mutant form, F667Y). Such a mixture is usefil in that it may increase signal uniformity 
generated from polymerization of a labeled nucleotide into a synthetic nucleotide. 

As used herein, "discrimination" refers to the tendency of DNA polymerase to not 
incorporate non-conventional nucleotides into a nascent DNA polymer. DNA 
polymerase has the ability to sense nucleotide structure, including but not limited to 
nucleotide base complementarity, and structural features of the sugar and heterocyclic 
base, thereby allowing DNA polymerase to preferentially utilize conventional 

- 16 - 



WO 01/32887 



PCT/US00/29706 



deoxynucleotides rather than non-conventional nucleotides for incorporation into a 
nascent polymer. DNA polymerase strongly prefers to incorporate the conventional 
deoxynucleotides dATP, dCTP, dGTP and TTP into DNA polymers; the polymerase is 
unlikely to progress with an unconventional nucleotide in its binding pocket. 

As used herein, "reduced discrimination" refers to a reduction of at least 50% in 
the tendency of a DNA polymerase to exclude a non-conventional nucleotide from (that 
is, to not incorporate non-conventional nucleotides into) a nascent DNA polymer, relative 
to a parental or wild type DNA polymerase which does not exhibit reduced 
discrimination. The preference of DNA polymerase to incorporate the conventional 
deoxynucleotides dATP, dCTP, dGTP and TTP rather than non-conventional nucleotides 
into DNA polymers is thereby reduced compared to the natural level of preference, such 
that non-conventional nucleotides are more readily incorporated into DNA polymers by 
DNA polymerase. According to the invention, a polymerase exhibiting reduced 
discrimination will exhibit reduced discrimination against at least one non-conventional 
nucleotides, but may not exhibit reduced discrimination against all non-conventional 
nucleotides. 

According to the invention, discrimination is quantitated by measuring the 
concentration of a non-conventional nucleotide required to inhibit the incorporation of 
the corresponding conventional nucleotide by 50%. This concentration is referred to 
herein as the "Iso%" for a non-conventional nucleotide. Discrimination against a given 
non-conventional nucleotide is "reduced" if the I 50 o/ o for that non-conventional nucleotide 
is reduced by at least two fold (50%) relative to an identical assay containing, in place of 
the mutant DNA polymerase, a parental DNA polymerase. 

Alternatively, reduced discrimination may be quantitated by determining the 
amount of a non-conventional nucleotide (for example, a dideoxynucleotide, 
ribonucleotide, or cordycepin) required in a reaction with a mutant polymerase having 
reduced discrimination to generate a sequencing ladder identical to a sequencing ladder 
produced using the wild-type or parental enzyme. The sequencing ladder can be 
examined, for example, in the range of 1 to 400 bases from the primer terminus, and the 
ladders will be identical in the number of extension products generated as well as the 
lengths of extension products generated in the sequencing reaction. For this type of 
assay, a constant amount of dNTPs and varying amounts of non-conventional nucleotides 
are used to generate a sequencing ladder with both the wild-type (or parental) enzyme 
and the mutant poplymerase (for ribonucleotides, a sequencing ladder is generated by 
alkali cleavage of the polymerization products). See Gardner & Jack, 1999, supra. A 
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mutant exhibits reduced discrimination if it requires at least two-fold (50%) less, five- 
fold (80%)less, ten-fold (100%) less, etc. of the amount of the non-conventional 
nucleotide used by the wild-type or parental polymerase to produce a sequencing ladder 
identical (with respect to the number and length of extension products generated) to that 
generated by the wild-type or parental enzyme. 

As used herein, the term "parental" or "progenitor" refers to a polymerase used as 
the starting material in generating a mutant polymerase having reduced discrimination. 
The term "parental" is meant to encompass not only a so-called "wild-type" enzyme as it 
occurs in nature, but also intermediate forms, for example, an exonuclease deficient 
enzyme that is used as the starting material for generating an enzyme with reduced 
discrimination against non-conventional nucleotides. 

As used herein, "non-conventional nucleotide" refers to a) a nucleotide structure 
that is not one of the four conventional deoxynucleotides dATP, dCTP, dGTP, and TTP 
recognized by and incorporated by a DNA polymerase, b) a synthetic nucleotide that is 
not one of the four conventional deoxynucleotides in (a), c) a modified conventional 
nucleotide, or d) a ribonucleotide (since they are not normally recognized or incorporated 
by DNA polymerases) and modified forms of a ribonucleotide. Non-conventional 
nucleotides include but are not limited to those listed in Table III, which are 
commercially available, for example, from New England Nuclear. Any one of the above 
non-conventional nucleotides may be a "conjugated nucleotide", which as used herein 
refers to nucleotides bearing a detectable label, including but not limited to a fluorescent 
label, isotope, chemiluminescent label, quantum dot label, antigen, or affinity moiety. 

As used herein, the term "cell", "cell line" and "cell culture" can be used 
interchangeably and all such designations include progeny. Thus, the words 
"transformants" or "transformed cells" includes the primary subject cell and cultures 
derived therefrom without regard for the number of transfers. It is also understood that 
all progeny may not be precisely identical in DNA content, due to deliberate or 
inadvertent mutations. Mutant progeny that have the same functionality as screened for 
in the originally transformed cell are included. 

As used herein, the term "organism transformed with a vector" refers to an 
organism carrying a recombinant gene construct. 

As used herein, "thermostable" refers to a property of a DNA polymerase, such 
that the enzyme active at elevated temperatures and is resistant to DNA duplex- 
denaturing temperatures in the range of about 93°C to about 97 °C. "Active" means the 
enzyme retains the ability to effect primer extension reactions when subjected to elevated 
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or denaturing temperatures for the time necessary to effect denaturation of double- 
stranded nucleic acids. Elevated temperatures as used herein refer to the range of about 
70°C to about 75 °C, whereas non-elevated temperatures as used herein refer to the range 
of about 35°C to about 50 °C. 

As used herein, "archaeal" refers to an organism or to a DNA polymerase from an 
organism of the kingdom Archaea. 

As used herein, "sequencing primer" refers to an oligonucleotide, whether natural 
or synthetic, which serves as a point of initiation of nucleic acid synthesis by a 
polymerase following annealing to a DNA strand to be sequenced. A primer is typically 
a single-stranded oligodeoxyribonucleotide. The appropriate length of a primer depends 
on the intended use of the primer, but for DNA sequencing applications typically ranges 
from about 1 5 to about 40 nucleotides in length. 

As used herein, "Family B DNA polymerase" refers to any DNA polymerase that 
is classified as a member of the Family B DNA polymerases, where the Family B 
classification is based on structural similarity to E. coli DNA polymerase II. The Family 
B DNA polymerases, formerly known as -family polymerases, include, but are not 
limited to those listed as such in Table I. 

As used herein, "Family A DNA polymerase" refers to any DNA polymerase that 
is classified as a member of the Family A DNA polymerases, where the Family A 
classification is based on structural similarity to E. coli DNA polymerase I. Family A 
DNA polymerases include, but are not limited to those listed as such in Table I. 

As used herein, "3' to 5' exonuclease deficient" or "3' to 5' exo" refers to an 
enzyme that substantially lacks the ability to remove incorporated nucleotides from the 3 ' 
end of a DNA polymer. DNA polymerase exonuclease activities, such as the 3 ' to 5 ' 
exonuclease activity exemplified by members of the Family B polymerases, can be lost 
through mutation, yielding an exonuclease-deficient polymerase. As used herein, a DNA 
polymerase that is deficient in 3' to 5' exonuclease activity substantially lacks 3' to 5' 
exonuclease activity. "Substantially lacks" encompasses a complete lack of activity, or a 
"substantial" lack of activity. "Substantial" lack of activity means that the 3' exonuclease 
activity of the mutant polymerase relative to the parental polymerase is 0.03%, and also 
may be 0.05%, 0.1%, 1%, 5%, 10%, or 20%, but is not higher than 50% of the 3' 
exonuclease activity of the parental or wild type polymerase. 

As used herein, "mutation" refers to a change introduced into a starting parental 
DNA sequence that changes the amino acid sequence encoded by the DNA. The 
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consequences of a mutation include but are not limited to the creation of a new character, 
property, function, or trait not found in the protein encoded by the parental DNA. 

As used herein, "wild- type" refers to the typical state of an organism, strain, gene, 
protein or characteristic as it occurs in nature. The wild-type is therefore the natural state 
that is distinguished from a mutant, which was derived from the wild type by introduction 
of change(s) to the wild-type. 

As used herein, "corresponding" refers to sequence similarity in a comparison of 
two or more nucleic acids or polypeptides, where functionally equivalent domains or sub- 
sequences are identified; such functionally equivalent domains or sub-sequences or 
amino acids within such a domain or sub-sequence are said to "correspond". That is, two 
or more sequences are compared through a comparative alignment analysis in which an 
entire sequence is examined for regions of sequence that are similar or identical, and thus 
regions likely to be functionally equivalent to regions from the other sequence(s) are 
identified. 

As used herein in reference to comparisons of an amino acid, amino acid 
sequence, or protein domain, the term "similar" refers to amino acids or domains that 
although not identical, represent "conservative" differences. By "conservative" is meant 
that the differing amino acid has like characteristics with the amino acid in the 
corresponding or reference sequence. Typical conservative substitutions are among Ala, 
Val, Leu and He; among Ser and Thr; among the acidic residues Asp and Glu; among 
Asn and Gin; and among the basic residues Lys and Arg; or aromatic residues Phe and 
Tyr. In calculating the degree (most often as a percentage) of similarity between two 
polypeptide sequences, one considers the number of positions at which identity or 
similarity is observed between corresponding amino acid residues in the two polypeptide 
sequences in relation to the entire lengths of the two molecules being compared. 

As used herein, the term "functionally equivalent" means that a given motif, 
region, or amino acid within a motif or region performs the same function with regard to 
the overall function of the enzyme as a motif, region or amino acid within a motif or 
region performs in another enzyme. 

As used herein, "chain terminating nucleotide analog" refers to a nucleotide 
analog that once incorporated cannot serve as a substrate for subsequent extension by a 
DNA polymerase, thereby terminating the elongation of a DNA polymer by a DNA 
polymerase. Such a nucleotide analog typically lacks a hydroxyl group on its sugar 
moiety to which DNA polymerase can synthesize a phosphodiester bond with an 
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incoming nucleotide. Chain terminating nucleotide analogs are a subset of non- 
conventional nucleotides, and include but are not limited to dideoxynucleotides. 

As used herein, "detectably labeled" refers to a structural modification that 
incorporates a functional group (label) that can be readily detected by various means. 
Compounds that can be detectably labeled include but are not limited to nucleotide 
analogs. Detectable nucleotide analog labels include but are not limited to fluorescent 
compounds, isotopic compounds, chemiluminescent compound, quantum dot labels, 
biotin, enzymes, electron-dense reagents, and haptens or proteins for which antisera or 
monoclonal antibodies are available. The various means of detection include but are not 
limited to spectroscopic, photochemical, biochemical, immunochemical, or chemical 
means. 

As used herein in reference to a polynucleotide or polypeptide, the term "isolated" 
means that a naturally occurring sequence has been removed from its normal cellular 
environment or is synthesized in a non-natural environment (e.g. , artificially 
synthesized). Thus, the sequence may be in a cell-free solution or placed in a different 
cellular environment. The term does not imply that the sequence is the only nucleotide or 
polypeptide chain present, but that it is essentially free (about 90-95% pure at least) of 
non-nucleotide or non-polypeptide material, respectively, naturally associated with it. 

As used herein, the term "recombinant" refers to a polynucleotide or polypeptide 
that is altered by genetic engineering (i.e., by modification or manipulation of the genetic 
material encoding that polynucleotide or polypeptide). 

The invention encompasses full length mutant DNA polymerases, as described 
herein, as well as a functional fragment of a mutant polymerase, that is, a fragment of a 
DNA polymerase that is less than the entire amino acid sequence of the mutant 
polymerase and retains the ability, under at least one set of conditions, to catalyze the 
polymerization of a polynucleotide. Such a functional fragment may exist as a separate 
entity, or it may be a constituent of a larger polypeptide, such as a fusion protein. 

As used herein, the term "complementary DNA strand" refers to that DNA 
molecule synthesized from a template DNA molecule by a DNA polymerase in a primer 
extension reaction. 

As used herein, the term "template DNA molecule" refers to that strand of a 
nucleic acid from which a complementary nucleic acid strand is synthesized by a DNA 
polymerase, for example, in a primer extension reaction. 



-21 - 



WO 01/32887 



PCT/US00/29706 



Further features and advantages of the invention will become more fully apparent 
in the following description of the embodiments and drawings thereof, and from the 
claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the DNA sequence encoding Thermococcus species JDF-3 DNA 
polymerase (intein removed) (SEQ ID NO: 1). 

Figure 2 shows the amino sequence of Thermococcus species JDF-3 DNA 
polymerase (intein removed) (SEQ ID NO: 2). 

Figure 3 shows the amino acid sequence of the genomic clone encoding 
Thermococcus species JDF-3 DNA polymerase (SEQ ID NO: 3). The position of an 
intein, removed by post-translational processing, is shown. 

Figure 4 shows the DNA sequence of the genomic clone encoding Thermococcus 
species JDF-3 DNA polymerase (SEQ ID NO: 4). DNA sequences are shown which 
correspond to 5' and 3' untranslated regions, polymerase-coding regions (exteins), and an 
intein-coding region. 

Figure 5 shows nucleotide incorporation by JDF-3 mutants. Lambda phage clones 
which incorporated 33 P-labeled ddNTPs in the primary library screen were rescreened to 
assess 33 P-ddNTP incorporation in the presence of: (panel 1) 0.5mM MnCl 2 or (panel 2) 
1 .5mM MgCl2. Polymerase activity was measured using 33 P-dNTPs in the presence of 
1.5mM MgCl 2 (panel 3). Nucleotide utilization is shown for clones 1-18 and for the 
parental #550 clone. 

Figure 6 shows 33 P-ddNTP cycle sequencing reactions performed using JDF-3 
polymerase mutants. Purified JDF-3 mutants were substituted into the Thermo Sequenase 
radiolabeled terminator cycle sequencing kit. DNA sequencing ladders were generated as 
per the kit's instructions using the following polymerases: (A) Thermo Sequenase (B) 
JDF-3 #550 clone (parental) (C) JDF-3 A485T mutant (clone pi 2) (D) JDF-3 P410L 
mutant (clone pi 1) (E) JDF-3 P410L mutant (clone p8). The top of the original 
sequencing gel is shown on the side. The lanes are: (bottom) ddGTP, ddATP, ddTTP, 
ddCTP (top). Clones p8, pi 1, and pl2 contain ancillary mutations and an amino-terminal 
tag. 

Figure 7 shows cycle sequencing reactions performed using dye-labeled 
ddNTPs and JDF-3 polymerase mutants. DNA sequencing ladders were generated using 
(1) 2.14|LiM dNTP: 0.0214jiM ddNTP; (2) 2.14^iM dNTP: 0.214jaM ddNTP; or (3) 
2.14jiM dNTP: 2.14(iM ddNTP. The following purified DNA polymerases were used: 
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(A) JDF-3 #550 clone (parental) (B) Thermo Sequenase (C) JDF-3 P410L mutant (clone 
p8 ? contains ancillary mutations and an amino tag) (E) JDF-3 L408H mutant (clone 1-1). 
The top of the original sequencing gel is shown on the right hand side. 

Figure 8 shows cycle sequencing reactions performed using the JDF-3 
P410L/A485T double mutant and - 33 P Dideoxynucleotides. DNA sequencing ladders 
were generated using the JDF-3 P410L/A485T double mutant at (A) 2^1 (B) 1 jllI (C) 
0.5]ul 5 the JDF-3 P410L mutant (clone p8 5 contains ancillary mutations and an amino- 
terminal tag)(D), or Thermo Sequenase (E). The top of the original sequencing gel is 
shown on the left side. The lanes are: (bottom) ddGTP, ddATP, ddTTP, ddCTP (top). 

Figure 9 shows the result of ribonucleotide incorporation assays using exo JDF-3 
(550) and mutants of this progenitor clone. The ratios of ribonucleotide versus 
deoxynucleotide incorporation are plotted for JDF-3 550, JDF-3 L408H, JDF-3 L408F 
and JDF-3 A485T. 

Figure 10 shows the traces of the sequence generated by four versions of JDF-3 
DNA polymerase and FAM ddCTP. Panel A shows the minimal trace produced by the 
progenitor polymerase JDF-3 550, Panel B demonstrates the slightly improved trace 
made by JDF-3 P410L, Panel C shows the sequence generated by the double mutant 
S345P and P410L, and Panel D shows the trace created by JDF-3 S345P. 

Figure 1 1 shows the difference in peak uniformity demonstrated by Thermo 
Sequenase in Panel A and the double mutant JDF-3 S345P + P410L in Panel B. 

Figure 12 shows the separated products of 3' extension of a labeled 
oligonucleotide with the dideoxynucleotide thymidine triphosphate of ROX-ddUTP (New 
England Nuclear (NEN) NEL476) or Fluorescein- 12-ddUTP (NEN NEL401). Mutant 4 
is JDF-3 S345P, Mutant 2 is JDF-3 P410L, Mutant 3 is JDF-3 A485T and Mutant 5 is 
Y496N. F indicates FLU ddUTP and R indicates ROX ddUTP. 

Figure 13 shows a graphic representation of the relative band intensities form 
Figure 12. The numerical values are generated by dividing the intensity value of the 
ddTTP band into the intensity value for the Fluroescein-12-ddUTP bands. 

Figure 14 shows the sequence alignment of dye-dideoxynucleotide selected JDF-3 
mutants (amino acids 301-480). Nucleic acid residues highlighted by white boxes 
indication the location of a mutation. The mutation S345P is one of two mutations 
present in mutant 28. 
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5 Figure 15 shows the sequence alignment of dye-dideoxynucleotide selected JDF-3 

(amino acids 481-660). Nucleic acid residues highlighted by white boxes indication the 
location of a mutation. 

DESCRIPTION 

The invention is based on the discovery of Family B DNA polymerases that bear 
1 0 one or more genetic alterations resulting in reduced discrimination against non- 

conventional nucleotides relative to their unmodified wild-type forms. All references 
described herein are incorporated by reference herein in their entirety. 

Family B DNA Polymerase Exhibiting Reduced Discrimination Against Non- 
Conventional Nucleotides: 
1 5 A. DNA Polymerases Useful According to the Invention 

According to the invention, DNA polymerases of Family B may be mutated to 
generate enzymes exhibiting reduced discrimination against non-conventional 
nucleotides. Table I includes a non-limiting list of known DNA polymerases categorized 
by family. 

20 Table I. DNA POLYMERASES BY FAMILY 

FAMILY A DNA POLYMERASES 



Bacterial DNA Polymerases 
Reference 

a) E. coli DNA polymerase I (1) 
25 b) Streptococcus pneumoniae DNA polymerase I (2) 

c) Thermus aquaticus DNA polymerase I (3) 

d) Thermus flavus DNA polymerase I (4) 

e) Thermotoga maritima DNA polymerase I 
Bacteriophage DNA Polymerases 

30 a) T5 DNA polymerase (5) 

b) T7 DNA polymerase (6) 

c) Spol DNA polymerase (7) 

d) Spo2 DNA polymerase (8) 
Mitochondrial DNA polymerase 

35 Yeast Mitochondrial DNA polymerase II (9, 10, 11) 
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5 FAMILY B DNA POLYMERASES 



Bacterial DNA polymerase 

E. coli DNA polymerase II (15) 
Bacteriophage DNA polymerase 

a) PRE) 1 DNA polymerase (16,17) 

10 b) 29 DNA polymerase (18) 

c) M2 DNA polymerase ( 1 9) 

d) T4 DNA polymerase (20) 
Archaeal DNA polymerase 

a) Thermococcus litoralis DNA polymerase (Vent) (21) 

15 b) Pyrococcus furiosus DNA polymerase (22) 

c) Sulfolobus solfataricus DNA polymerase (23) 

d) Thermococcus gorgonarius DNA polymerase (64) 

e) Thermococcus species TY (65) 

f) Pyrococcus species strain KODI (66) 
20 g) Sulfolobus acidocaldarius (67) 

h) Thermococcus species 9°N-7 (68) 

i) Pyrodictium occultum (69) 
j) Methanococcus voltae (70) 
k) Desulfarococcus strain TOK (D. Tok Pol) (7 1 ) 

25 Eukaryotic Cell DNA polymerase 

(1) DNA polymerase alpha 

a) Human DNA polymerase (alpha) (24) 

b) S.cerevisiae DNA polymerase (alpha) (25) 

c) S.pombe DNA polymerase I (alpha) (26) 
30 d) Drosophila melanogaster DNA polymerase (alpha) (27) 

e) Trypanosoma brucei DNA polymerase (alpha) (28) 

(2) DNA polymerase delta 

a) Human DNA polymerase (delta) (29, 30) 

b) Bovine DNA polymerase (delta) (31) 
35 . c) S. cerevisiae DNA polymerase III (delta) (32) 

d) S. pombe DNA polymerase III (delta) (33) 

e) Plasmodiun falciparum DNA polymerase (delta) (34) 

(3) DNA polymerase epsilon 

S. cerevisiae DNA polymerase II (epsilon) (35) 
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5 (4) Other eukaryotic DNA polymerase 

S. cerevisiae DNA polymerase Rev3 (36) 
Viral DNA polymerases 

a) Herpes Simplex virus type 1 DNA polymerase (37) 

b) Equine herpes virus type 1 DNA polymerase (38) 
10 c) Varicella-Zoster virus DNA polymerase (39) 

d) Epstein-Barr virus DNA polymerase (40) 

e) Herpesvirus saimiri DNA polymerase (41) 

f) Human cytomegalovirus DNA polymerase (42) 

g) Murine cytomegalovirus DNA polymerase (43) 
1 5 h) Human herpes virus type 6 DNA polymerase (44) 

i) Channel Catfish virus DNA polymerase (45) 
j) Chlorella virus DNA polymerase (46) 
k) Fowlpox virus DNA polymerase (47) 
1) Vaccinia virus DNA polymerase (48) 

20 m) Choristoneura biennis DNA polymerase (49) 

n) Autographa California nuclear polymerase virus (AcMNPV) 

DNA polymerase (50) 

0) Lymantria dispar nuclear polyhedrosis virus DNA polymerase (51) 
p) Adenovirus-2 DNA polymerase (52) 

25 q) Adenovirus-7 DNA polymerase (53) 

r) Adenovirus- 12 DNA polymerase (54) 
Eukaryotic linear DNA plasmid encoded DNA polymerases 

a) S-l Maize DNA polymerase (55) 

b) kalilo neurospora intermedia DNA polymerase (56) 
30 c) pA12 ascobolus immersus DNA polymerase (57) 

d) pCLKl Claviceps purpurea DNA polymerase (58) 

e) maranhar neurospora crassa DNA polymerase (59) 

f) pEM Agaricus bitorquis DNA polymerase (60) 

g) pGKLl Kluyveromyces lactis DNA polymerase (61) 
35 h) pGKL2 Kluyveromyces lactis DNA polymerase (62) 

1) pSKL Saccharomyces kluyveri DNA polymerase (63) 
B. Plasmids 



The starting sequences for the generation of Family B DNA polymerases 

according to the invention may be contained in a plasmid vector. A non-limiting list of 
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5 cloned Family B DNA polymerases and their GenBank Accession numbers are listed in 

Table II. 

Table II. Accession Information for Cloned Family B 

olymerases 

Vent Thermococcus litoralis 
10 ACCESSION AAA72101 

PID g348689 

VERSION AAA72 101.1 GI:348689 
DBSOURCE locus THCVDPE accession M74198. 1 
THEST THERMOCOCCUS SP. (STRAIN TY) 
15 ACCESSION 033845 

PID g39 13524 
VERSION 033845 GE3913524 

DBSOURCE swissprot: locus DPOL THEST, accession 033845 
Pab Pyrococcus abyssi 
20 ACCESSION P77916 

PID g39 13529 
VERSION P77916 GI:39 13529 

DBSOURCE swissprot: locus DPOL PYRAB, accession P77916 
PYRHO Pyrococcus horikoshii 
25 ACCESSION 059610 

PID g39 13526 
VERSION 059610 GE3913526 

DBSOURCE swissprot: locus DP OL_P YRHO , accession 0596 1 0 
PYRSE PYROCOCCUS SP. (STRAIN GE23) 
30 ACCESSION P77932 

PID g3913530 
VERSION P77932 GI:39 13530 

DBSOURCE swissprot: locus DPOL PYRSE, accession P77932 
DeepVent Pyrococcus sp. 
35 ACCESSION AAA67131 

PID g436495 

VERSION AAA67131.1 GI:436495 
DBSOURCE locus PSU00707 accession U00707. 1 
Pfu Pyrococcus furiosus 
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5 ACCESSION P80061 

PID g399403 
VERSION P80061 GI:399403 

DBSOURCE swissprot: locus DPOLPYRFU, accession P80061 

JDF-3 Thermococcus sp. 
1 0 Unpublished 

Baross gi|2097756|pat|US|560201 1|12 Sequence 12 from patent US 560201 1 

9degN THERMOCOCCUS SP. (STRAIN 90N-7). 

ACCESSION Q56366 

PID g39 13540 
15 VERSION Q56366 GI:39 13540 

DBSOURCE swissprot: locus DPOL_THES9, accession Q56366 

KOD Pyrococcus sp. 

ACCESSION BAA06142 

PID g 16209 11 
20 VERSION BAA06142.1 GI:1620911 

DBSOURCE locus PYWKODPOL accession D29671 .1 

Tgo Thermococcus gorgonarius. 

ACCESSION 4699806 

PID g4699806 
25 VERSION GI:4699806 

DBSOURCE pdb: chain 65, release Feb 23, 1 999 

THEFM Thermococcus fumicolans 

ACCESSION P74918 

PID g3913528 
30 VERSION P74918 GI:3913528 

DBSOURCE swissprot: locus DPOL THEFM, accession 

P74918 

METTH Methanobacterium thermoautotrophicum 
ACCESSION 027276 
35 PID g3913522 

VERSION 027276 GI:39 13522 

DBSOURCE swissprot: locus DPOL METTH, accession 
027276 

Metja Methanococcus jannaschii 
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5 ACCESSION Q58295 

PID g39 15679 
VERSION Q58295 GI:39 15679 

DBSOURCE swissprot: locus DP OLMET J A, accession 
Q58295 

1 0 POC Pyrodictium occultum 

ACCESSION B56277 

PID gl363344 

VERSION B56277 GI: 1363344 

DBSOURCE pir: locus B56277 
1 5 Apel Aeropyrum pernix 

ACCESSION BAA81109 

PID g5 105797 

VERSION BAA81109.1 GI:5 105797 
DBSOURCE locus AP000063 accession AP000063. 1 
20 ARCFU Archaeoglobus fulgidus 

ACCESSION 029753 
PID g3122019 
VERSION 029753 GI:3122019 

DBSOURCE swissprot: locus DPOLARCFU, accession 

25 029753 

Desulfurococcus sp. Tok. 
ACCESSION 6435708 
PID g64357089 
VERSION GT:6435708 

30 DBSOURCE pdb. chain 65, release Jun 2, 1 999 

Plasmids acceptable for the expression of modified forms of Family B DNA 
polymerases may be selected from a large number known in the art by one of skill in the 
art. A plasmid vector for expression of a modified DNA polymerase according to the 
invention will preferably comprise sequences directing high level expression of a DNA 

35 polymerase, and will more preferably comprise sequences directing inducible, high level 

expression of a DNA polymerase. As one example of an inducible high level expression 
system, plasmids placing a modified DNA polymerase coding sequence according to the 
invention under the control of a bacteriophage T7 promoter may be introduced to bacteria 
containing an inducible T7 RNA polymerase gene within their chromosome. Induction 
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5 of the T7 RNA polymerase gene subsequently induces high level expression of the T7 

promoter-driven modified DNA polymerase gene (see for example, Gardner & Jack, 
Nucleic Acids Res. 27: 2545). 
C. Mutagenesis 

The cloned wild-type form of a Family B DNA polymerase may be mutated to 

1 0 generate forms exhibiting reduced discrimination against non-conventional nucleotides 

by a number of methods. 

First, methods of random mutagenesis which will result in a panel of mutants 
bearing one or more randomly-situated mutations exist in the art . Such a panel of 
mutants may then be screened for those exhibiting reduced discrimination relative to the 

1 5 wild-type polymerase (see "Methods of Evaluating Mutants for Reduced 

Discrimination", below). An example of a method for random mutagenesis is the so- 
called "error-prone PCR method". As the name implies, the method amplifies a given 
sequence under conditions in which the DNA polymerase does not support high fidelity 
incorporation. The conditions encouraging error-prone incorporation for different DNA 

20 polymerases vary, however one skilled in the art may determine such conditions for a 

given enzyme. A key variable for many DNA polymerases in the fidelity of 
amplification is, for example, the type and concentration of divalent metal ion in the 
buffer. The use of manganese ion and/or variation of the magnesium or manganese ion 
concentration may therefore be applied to influence the error rate of the polymerase. 

25 Second, there are a number of site-directed mutagenesis methods known in the art 

which allow one to mutate a particular site or region in a straightforward manner. There 
are a number of kits available commercially for the performance of site-directed 
mutagenesis, including both conventional and PCR-based methods. Examples include 
the EXSITE™ PCR-Based Site-directed Mutagenesis Kit available from Stratagene 

30 (Catalog No. 200502; PCR based) and the QUIKCHANGE™ Site-directed mutagenesis 

Kit from Stratagene (Catalog No. 200518; non-PCR-based), and the CHAMELEON® 
double-stranded Site-directed mutagenesis kit, also from Stratagene (Catalog No. 
200509). 

Older methods of site-directed mutagenesis known in the art relied upon sub- 
35 cloning of the sequence to be mutated into a vector, such as an Ml 3 bacteriophage 

vector, that allows the isolation of single-stranded DNA template. In these methods one 
annealed a mutagenic primer (i.e., a primer capable of annealing to the site to be mutated 
but bearing one or mismatched nucleotides at the site to be mutated) to the single- 
stranded template and then polymerized the complement of the template starting from the 
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5 3' end of the mutagenic primer. The resulting duplexes were then transformed into host 

bacteria and plaques were screened for the desired mutation. 

More recently, site-directed mutagenesis has employed PCR methodologies, 
which have the advantage of not requiring a single-stranded template. In addition, 
methods have been developed that do not require sub-cloning. Several issues must be 

1 0 considered when PCR-based site-directed mutagenesis is performed. First, in these 

methods it is desirable to reduce the number of PCR cycles to prevent expansion of 
undesired mutations introduced by the polymerase. Second, a selection must be 
employed in order to reduce the number of non-mutated parental molecules persisting in 
the reaction. Third, an extended-length PCR method is preferred in order to allow the use 

15 of a single PCR primer set. And fourth, because of the non-template-dependent terminal 

extension activity of some thermostable polymerases it is often necessary to incorporate 
an end-polishing step into the procedure prior to blunt-end ligation of the PCR-generated 
mutant product. 

The protocol described below accomodates these considerations through the 
20 following steps. First, the template concentration used is approximately 1000-fold 

higher than that used in conventional PCR reactions, allowing a reduction in the number 
of cycles from 25-30 down to 5-10 without dramatically reducing product yield. Second, 
the restriction endonuclease Dpnl 

(recognition target sequence: 5-Gm6ATC-3, where the A residue is methylated) is 
25 used to select against parental DNA, since most common strains of E. coli Dam 

methylate their DNA at the sequence 5-GATC-3. Third, Taq Extender is used in the PCR 
mix in order to increase the proportion of long (i.e., full plasmid length) PCR products. 
Finally, Pfu DNA polymerase is used to polish the ends of the PCR product prior to 
intramolecular ligation using T4 DNA ligase. 
30 The method is described in detail as follows: 

PCR-based Site Directed Mutagenesis of the 3 '-5' Exonuclease domain 
Plasmid template DNA (approximately 0.5 pmole) is added to a PCR cocktail 
containing: lx mutagenesis buffer (20 mM Tris HC1, pH 7.5; 8 mM MgC12; 40 ug/ml 
BSA); 12-20 pmole of each primer (one of skill in the art may design a mutagenic primer 
35 as necessary, giving consideration to those factors such as base composition, primer 

length and intended buffer salt concentrations that affect the annealing characteristics of 
oligonucleotide primers; one primer must contain the desired mutation, and one (the same 
or the other) must contain a 5' phosphate to facilitate later ligation), 250 uM each dNTP, 
2.5 U Taq DNA polymerase, and 2.5 U of Taq Extender (Available from Stratagene; See 
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5 Nielson et al. (1994) Strategies 7: 27, and U.S. Patent No. 5,556,772). The PCR cycling 

is performed as follows: 1 cycle of 4 min at 94°C, 2 min at 50°C and 2 min at 72°C; 
followed by 5-10 cycles of 1 min at 94°C, 2 min at 54°C and 1 min at 72°C. The parental 
template DNA and the linear, PCR-generated DNA incorporating the mutagenic primer 
are treated with Dpnl (10 U) and Pfii DNA polymerase (2.5U). This results in the Dpnl 

1 0 digestion of the in vivo methylated parental template and hybrid DNA and the removal, 

by Pfu DNA polymerase, of the non-template-directed Taq DNA polymerase-extended 
base(s) on the linear PCR product. The reaction is incubated at 37°C for 30 min and then 
transferred to 72°C for an additional 30 min. Mutagenesis buffer (1 1 5 ul of lx) 
containing 0.5 mM ATP is added to the DpnI-digested, Pfu DNA polymerase-polished 

15 PCR products. The solution is mixed and 10 ul are removed to a new micro fuge tube and 

T4 DNA ligase (2-4 U) is added. The ligation is incubated for greater than 60 min at 
37°C. Finally, the treated solution is transformed into competent E. coli according to 
standard methods. 

D. Non- Conventional Nucleotides Useful According to the Invention. 
20 There is a wide variety of non-conventional nucleotides available in the art. Any 

or all of them are contemplated for use with a DNA polymerase of the invention. A non- 
limiting list of such non-conventional nucleotides is presented in Table III. 

Table III. Non-Conventional Nucleotides 
DIDEOXYNUCLEOTIDE ANALOGS 

25 



30 



35 



Fluorescein Labeled 


Fluorophore Labeled 


Fluorescein- 1 2-ddCTP 


Eosin-6-ddCTP 


Fluorescein- 1 2-ddUTP 


Coumarin-5-ddUTP 


Fluorescein- 1 2-ddATP 


Tetramethylrhodamine-6-ddUTP 


Fluorescein- 1 2-ddGTP 


Texas Red-5-ddATP 


Fluorescein-N6-ddATP 


LISSAMINE™-rhodamine-5-ddGTP 


FAM Labeled 


TAMRA Labeled 


FAM-ddUTP 


TAMRA-ddUTP 


FAM-ddCTP 


TAMRA-ddCTP 


FAM-ddATP 


TAMRA-ddATP 


FAM-ddGTP 


TAMRA-ddGTP 


ROX Labeled 


JOE Labeled 


ROX-ddUTP 


JOE-ddUTP 


ROX-ddCTP 


JOE-ddCTP 


ROX-ddATP 


JOE-ddATP 
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ROX-ddGTP 
R6G Labeled 
R6G-ddUTP 
R6G-ddCTP 
R6G-ddATP 
R6G-ddGTP 
BIOTIN Labeled 
Biotin-N6-ATP 



TTP Analogs 

Fluorescein- 1 2-dUTP 

Coumarin-5-dUTP 

Tetramethylrhodamine-6-dUTP 

Tetraethylrhodamine-6-dUTP 

Texas Red-5-dUTP 

LISSAMINE™-rhodamine-5-dUTP 



JOE-ddGTP 

Rl 10 Labeled 
RllO-ddUTP 
RllO-ddCTP 
RllO-ddATP 
RllO-ddGTP 
DNP Labeled 
DNP-N6-ddATP 
DEOXYNUCLEOTIDE ANALOGS 
dATP-Analogs 



Naphthofluorescein-5-dUTP 
Fluorescein Chlorotriazinyl-4-dUTP 
Pyrene-8-dUTP 

Diethylaminocoumarin-5-dUTP 
Biotin-N6-dATP 
dCTP Analogs 
Coumarin-5-dCTP 
Fluorescein- 1 2-dCTP 
Tetramethylrhodamine-6-dCTP 
Texas Red-5-dCTP 
LISSAMINE™-rhodamine-5-dCTP 
Naphthofluorescein-5-dCTP 
Fluorescein Chlorotriazinyl-4-dCTP 
Pyrene-8-dCTP 
Fluorescein-N4-dCTP 
Biotin-N4-dCTP 
DNP-N4-dCTP 

RIBONUCLEOTIDE ANALOGS 
CTP Analogs UTP Analogs 
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Coumarin-5-dATP 
Diethylaminocoumarin-5-dATP 
Fluorescein- 1 2-dATP 
Fluorescein Chlorotriazinyl-4-dATP 
LISSAMINE™-rhodamine-5-dATP 
N aphthofluorescein-5 -d ATP 
Pyrene-8-dATP 

Tetramethylrhodamine-6-dATP 
Texas Red-5-dATP 
DNA-N6-dATP 



dGTP Analogs 

Coumarin-5-dGTP 
Fluorescein- 1 2-dGTP 
Tetramethylrhodamine-6-dGTP 
Texas Red-5-dGTP 
LISSAMINE™-rhodamine-5-dGTP 



Diethylaminocoumarin-5-dCTP 
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Coumarin-5-CTP 



Fluorescein- 1 2-UTP 



Fluorescein- 1 2-CTP 



Coumarin-5-UTP 



Tetrainethylrhodainine-6-CTP 



Tetramethylrhodamine-6-UTP 



Texas Red-5-CTP 



Texas Red-5-UTP 



LISS AMINE 1 M -rhodamine-5-CTP 
Naphthofluorescein-5-CTP 
Fluorescein Chlorotriazinyl-4-CTP 
Pyrene-8-CTP 



Naphthofluorescein-5-UTP 
Fluorescein Chlorotriazinyl-4-UTP 
Pyrene-8-UTP 



LISSAMINE 1M -5-UTP 



Fluorescein~N4-CTP 



Biotin-N4-CTP 



ATP Analogs 



Coumarin-5-ATP 

Fluorescein- 1 2-ATP 

Tetramethylrhodamine-6-ATP 

Texas Red-5-ATP 

LISSAMINE™-rhodamine-5-ATP 

Fluorescein-N6-ATP 

Biotin-N6-ATP 

DNP-N6-ATP 

Additional non-conventional nucleotides useful according to the invention 
include, but are not limited to 7-deaza-dATP, 7-deaza-dGTP, 5'-methyl-2'- 
deoxycytidine-5' -triphosphate. Further non-conventional nucleotides or variations on 
those listed above are discussed by Wright & Brown, 1990, Pharmacol. Ther. 47: 447. It 
is specifically noted that ribonucleotides qualify as non-conventional nucleotides, since 
ribonucleotides are not generally incorporated by DNA polymerases. Modifications of 
Family B DNA polymerases that result in the ability, or enhanced ability, of the 
polymerase to incorporate labeled or unlabeled ribonucleotides are specifically 
contemplated herein. 

E. Methods of Evaluating Mutants for Reduced Discrimination 

Random or site-directed mutants generated as known in the art or as described 
herein and expressed in bacteria may be screened for reduced discrimination against non- 
conventional nucleotides by several different assays. In one method, Family B DNA 
polymerase proteins expressed in lytic lambda phage plaques generated by infection of 
host bacteria with expression vectors based on, for example, Lambda ZapII®, are 
transferred to a membrane support. The immobilized proteins are then assayed for 
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polymerase activity on the membrane by immersing the membranes in a buffer 
containing a DNA template and the unconventional nucleotides to be monitored for 
incorporation. 

Mutant polymerase libraries may be screened using a variation of the technique 
used by Sagner et al (Sagner, G., Ruger, R., and Kessler, C. (1991) Gene 97:1 19-123). 
For this approach, lambda phage clones are plated at a density of 10-20 plaques per 
square centimeter. Proteins present in the plaques are transferred to filters and moistened 
with polymerase screening buffer (50mM Tris (pH 8.0), 7mM MgCl 2) 3mM p-ME). The 
filters are kept between layers of plastic wrap and glass while the host cell proteins are 
heat-inactivated by incubation at 65°C for 30 minutes. The heat-treated filters are then 
transferred to fresh plastic wrap and approximately 35jil of polymerase assay cocktail are 
added for every square centimeter of filter. The assay cocktail consists of IX cloned Pfu 
(cPfu) magnesium free buffer (IX buffer is 20 mM Tris-HCl (pH 8.8), 10 mM KC1, 10 
mM (NH4) 2 S0 4 , 100 ug/ml bovine serum albumin (BSA), and 0.1% Triton X-100; Pfu 
Magnesium-free buffer may be obtained from Stratagene (Catalog No. 200534)), 125 
ng/ml activated calf thymus or salmon sperm DNA, 1 .29 juCi/ml - 33 P ddNTP or 
dideoxynucleotides (at a dNTP:dye-ddNTP ratio of 1 : 1 5). Initial screening was done in 
the presence of MnCl 2 , but the preferred method was to screen in IX Taq Polymerase 
buffer (1.5 mM MgCb) The filters are placed between plastic wrap and a glass plate and 
then incubated at 65°C for one hour, and then at 70°C for one hour and fifteen minutes. 
Filters are then washed three times in 2X SSC for five minutes per wash before rinsing 
twice in 100% ethanol and vacuum drying. Filters are then exposed to X-ray film 
(approximately 16 hours), and plaques that incorporate label are identified by aligning the 
filters with the original plate bearing the phage clones. Plaques identified in this way are 
re-plated at more dilute concentrations and assayed under similar conditions to allow the 
isolation of purified plaques. 

In assays such as the one described above, the signal generated by the label is a 
direct measure of the activity of the polymerase with regard to that particular 
unconventional nucleotide or combination of unconventional nucleotides used in the 
assay. Unconventional nucleotides corresponding to all four conventional nucleotides 
may be included in the reactions, or, alternatively, only one unconventional nucleotide 
may be included to assess the effect of the mutation(s) on utilization of a given 
unconventional nucleotide. One approach is to use unconventional nucleotides 
corresponding to all four nucleotides in a first screen to identify clones that incorporate 
more than a reference wild-type clone, and then to monitor the incorporation of 
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individual unconventional nucleotides in a subsequent screen. . In the preferred screening 
mode, only the dideoxynucleotides and dideoxynucleotide analogs of ddATP, ddCTP, 
and ddTTP would be used since ddGTP is not discriminated against by some DNA 
polymerases and increases the background signal of any screen 

In order to screen for clones with enhanced ability to incorporate 
dideoxynucleotides, clones identified in first screens utilizing only dideoxynucleotides 
may then be characterized by their sensitivity to low levels of each of the four 
dideoxynucleotides in a DNA polymerase nucleotide incorporation assay employing all 
four dNTPs, a ;H-TTP tracer, and a low level of each ddNTP. Since incorporation of 
dideoxynucleotides stops DNA chain elongation, superior ability to incorporate 
dideoxynucleotides diminishes the incorporation of tritium labeled deoxynucleotides 
relative to wild-type DNA polymerase. Comparisons of ddNTP concentrations that bring 
about 50% inhibition of nucleotide incorporation (Iso%) can be used to compare ddNTP 
incorporation efficiency of different polymerases or polymerase mutants. Comparisons of 
ho% values for ddATP, ddCTP, ddGTP, and ddTTP can be used to identify mutants with 
reduced selectivity for particular bases. Such mutants would be expected to produce more 
uniform DNA sequencing ladders. 

In order to measure incorporation of individual ddNTPs, cocktails are prepared 
which consist of varying concentrations of the ddNTP of interest, and a total of 200juM of 
each nucleotide triphosphate. For example, the incorporation of ddATP by wild type 
JDF-3 polymerase may be measured at 0, 40, 80, 120 and 160|uM ddATP. In these 
reactions, dATP concentrations would be adjusted to 200, 160, 120, 80, and 40juM, 
respectively, so that the total amount of adenine nucleotide triphosphate is 200|uM. In 
comparison, mutants may be assayed using ddATP concentrations of 0, 5, 10, and 20)llM 
ddATP, and adjusted dATP concentrations of 200, 195, 190, and 180|uM, respectively 
(dATP + ddATP = 200jjM). Additional cocktails are prepared to similarly measure 
ddCTP, ddGTP, and ddTTP incorporation. 

Incorporation of nucleotides under the concentration parameters described above 
may be measured in extension reactions by adding, for example, 1 jllI of appropriately 
diluted bacterial extract (i.e., heat-treated and clarified extract of bacterial cells(see 
Example 1, part M) expressing a cloned polymerase or mutated cloned polymerase) to 10 
ul of each nucleotide cocktail, followed by incubation at 72°C for 30 minutes. Extension 
reactions are quenched on ice, and then 5jal aliquots are spotted immediately onto DE81 
ion-exchange filters (2.3cm; Whatman #3658323). Unincorporated label is removed by 6 
washes with 2xSCC (0.3M NaCl, 30mM sodium citrate, pH 7.0), followed by a brief 
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wash with 100% ethanol. Incorporated radioactivity is then measured by scintillation 
counting. Reactions that lack enzyme are also set up along with sample incubations to 
determine "total cpms" (omit filter wash steps) and "minimum cpms"(wash filters as 
above). 

Cpms bound is proportional to the amount of polymerase activity present per 
volume of bacterial extract. The volume of bacterial extract (generally about 0.25- ljji) 
which brings about incorporation of approximately 10,000 cpms is determined for use in 
subsequent nucleotide analog incorporation testing. 

Genes for mutant DNA polymerases generated by random mutagenesis may be 
sequenced to identify the sites and number of mutations. For those mutants comprising 
more than one mutation, the effect of a given mutation may be evaluated by introduction 
of the identified mutation to the exo" progenitor gene by site-directed mutagenesis in 
isolation from the other mutations borne by the particular mutant. Screening assays of 
the single mutant thus produced will then allow the determination of the effect of that 
mutation alone. 

F. Expression of Mutated Family B DNA Polymerase According to the Invention 

Methods known in the art may be applied to express and isolate the mutated forms 
of Family B DNA polymerase according to the invention. Many bacterial expression 
vectors contain sequence elements or combinations of sequence elements allowing high 
level inducible expression of the protein encoded by a foreign sequence. For example, as 
mentioned above, bacteria expressing an integrated inducible form of the T7 RNA 
polymerase gene may be transformed with an expression vector bearing a mutated DNA 
polymerase gene linked to the T7 promoter. Induction of the T7 RNA polymerase by 
addition of an appropriate inducer, for example, isopropyl-P-D-thiogalactopyranoside 
(IPTG) for a lac-inducible promoter, induces the high level expression of the mutated 
gene from the T7 promoter (see Gardner & Jack, 1999, supra). 

Appropriate host strains of bacteria may be selected from those available in the art 
by one of skill in the art. As a non-limiting example, E. coli strain BL-2 1 is commonly 
used for expression of exogenous proteins since it is protease deficient relative to other 
strains of E. coli. BL-21 strains bearing an inducible T7 RNA polymerase gene include 
WJ56 and ER2566 (Gardner & Jack, 1999, supra). For situations in which codon usage 
for the particular polymerase gene differs from that normally seen in E. coli genes, there 
are strains of BL-21 that are modified to carry tRNA genes encoding tRNAs with rarer 
anticodons (for example, argU, ileY, leuW, and proL tRNA genes), allowing high 
efficiency expression of cloned protein genes, for example, cloned archaeal enzyme 
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5 genes (several BL21-CODON PLUS cell strains carrying rare-codon tRNAs are 

available from Stratagene, for example). 

There are many methods known to those of skill in the art that are suitable for the 
purification of a modified DNA polymerase of the invention. For example, the method of 
Lawyer et al. (1993, PCR Meth. & App. 2: 275) is well suited for the isolation of 

10 thermostable DNA polymerases expressed in E. coli, as it was designed originally for the 

isolation of Taq polymerase. Alternatively, the method of Kong et al. (1993, J. Biol. 
Chem. 268: 1965, incorporated herein by reference) may be used, which employs a heat 
denaturation step to destroy host proteins, and twp column purification steps (over 
DEAE-Sepharose and heparin-Sepharose columns) to isolate highly active and 

15 approximately 80% pure thermostable DNA polymerase. Further, as detailed in Example 

1 , part N, below, DNA polymerase mutants may be isolated by an ammonium sulfate 
fractionation, followed by Q Sepharose and DNA cellulose columns, or by adsorption of 
contaminants on a HiTrap Q column, followed by gradient elution from a HiTrap heparin 
column. 

20 G. Preparation of Thermococcus species JDF-3 Thermostable DNA Polymerase With 

Reduced Discrimination 

To prepare thermostable Family B polymerases which exhibit reduced 
discrimination for dideoxynucleotide triphosphates (ddNTPs), the DNA sequence 
encoding a 3' to 5' exonuclease- deficient (D141A) Family B polymerase from the 

25 hyperthermophilic archaeon Thermococcus species JDF-3 was subjected to random 

mutagenesis using "error-prone PCR" as described herein, and cloned into the 
bacteriophage lambda Zap®IL The polymerase from JDF-3 was chosen due to superior 
processivity, polymerization rate and ddNTP incorporation relative to the Family B DNA 
polymerase from Pyrococcus furiosus (Pfu) (see Table IV, below). The library of 

30 mutants was plated on E. coli hosts and the proteins present in the lytic plaques were 

transferred to a solid support that was then immersed in a buffer containing DNA 
template and all four - 33 P labeled dideoxynucleotides. Mutants that incorporated the 
labeled dideoxynucleotide produced signals that corresponded to their ability to 
incorporate the - 33 P ddNTPs. Isolated clones were then characterized by their 

35 sensitivity to low levels of each of the four dideoxynucleotides in a DNA polymerase 

nucleotide incorporation assay employing all four dNTPs and a ;H-TTP tracer. Since 
incorporation of dideoxynucleotides stops DNA chain elongation, superior ability to 
incorporate dideoxynucleotides diminishes the incorporation of tritium labeled 
deoxynucleotides. The unmutated progenitor DNA polymerase rarely incorporates 

-38 - 



WO 01/32887 



PCT/US00/29706 



dideoxynucleotides and is only 50% inhibited at high ddNTP levels (100-160 micromolar 
each ddNTP). The mutant enzymes show 50% inhibition at 5 to 40 micromolar 
concentrations of ddNTP and improved incorporation was observed for all four ddNTPs 
(ddATP, ddCTP, ddTTP and ddGTP; see Tables V and VI in Example 1, below). 

The incorporation of non-conventional nucleotides was also evaluated through 
use of purified mutant polymerases in cycle sequencing, with - 33 P labeled ddNTPs 
present at 0.02 IjjM and dNTPs present at 2.1|uM each. The mutants readily utilized all 
four dideoxynucleotides and produced sequencing ladders that compared favorably to 
Thermo Sequenase®, which uses an F667Y Taq DNA polymerase mutant (VanderHorn 
et al., 1997, BioTechniques 22: 758). 

The domains of relevance in 1 7 of the 40 purified mutants were sequenced. Most 
randomly mutated clones contained more than one mutation in the regions sequenced but 
all mutants contained mutations at one of three sites. Mutations predicted to confer an 
enhanced ddNTP uptake phenotype were introduced into the progenitor exonuclease 
deficient DNA polymerase sequence by site-directed mutagenesis to eliminate ancillary 
mutations which were not expected to contribute to the improved dideoxynucleotide 
uptake phenotype. 

Sixteen of the seventeen JDF-3 DNA polymerase mutations were found in region 
II (motif A) on either side of the tyrosine in the consensus sequence 404 DxxSLYPSII 
413. These mutations consisted of DFRSLYLSII (P410L), DFRSHYPSII (L408H) and 
DFRSFYPSII (L408F). Therefore, the LYP motif of region II appears to be important in 
ddNTP discrimination in the JDF-3 Family B polymerase. 

The prior art modification of the tyrosine corresponding to Y409 in JDF3 Family 
B DNA polymerase is recognized for its positioning in the nucleotide binding pocket. As 
shown herein, however, modification of the residues neighboring Y409 (L408H or L408F 
or P410L) had the unexpected effect of profoundly altering nucleotide binding, 
particularly with respect to ddNTP incorporation. 

The only JDF-3 DNA polymerase mutation leading to enhanced incorporation of 
non-conventional nucleotides occurring outside of region II is an alanine (ala or A) to 
threonine (thr or T) conversion at position 485 in region III (A485T). This site is two 
residues upstream of KX3NSXYG (Jung et al., 1990, supra; Blasco et al., 1992, supra; 
Dong et al., 1993, J. Biol. Chem. 268:21 163; Zhu et al., 1994, Biochem. Biophys. Acta 
1219:260; Dong and Wang, 1995, J. Biol. Chem. 270:21563) (referred to as region III or 
motif B) which is functionally, but not structurally (Wang et al., 1997, supra), analogous 
to KX 3 (F/Y)GX 2 YG in helix O of the Family A DNA polymerases. In Family A DNA 
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5 polymerases, such as the Klenow fragment and Taq DNA polymerases, the O helix 

contains amino acids that play a major role in dNTP binding (Astatke et al., 1998, J. Mol. 
Biol. 278:147; Astatke et al., 1995, J. Biol. Chem. 270:1945; Polesky et al., 1992, J. Biol. 
Chem 267:8417; Polesky et al., 1990, J. Biol. Chem. 265:14579; Pandey et al., 1994, J. 
Biol. Chem. 269:13259; Kaushik et al., 1996, Biochem. 35:7256). Specifically, helix O 

10 contains the F (F762 in the Klenow fragment; F667 in Taq) which confers ddNTP 

discrimination in Family A DNA polymerases (KX 3 ( F/Y )GX 2 YG) (Tabor and 
Richardson, 1995, supra). 

The effect of the A485T mutation on ddNTP incorporation in the JDF-3 DNA 
polymerase is surprising since the RB69 and Thermococcus gorgonarius crystal 

15 structures (Hopfher et al., 1999, supra) show it facing away from the proposed active site 

of the nucleotide binding surface. Moreover, the type of side chain conferring ribose 
selectivity in archaeal Family B DNA polymerases (A: small, non-polar) is different from 
that of the bulky, aromatic Y and F residues that dictate ddNTP discrimination in Family 
A DNA polymerases (Tabor and Richardson, 1995, supra). Additionally, this position 

20 (A485) is not well conserved among either DNA polymerase family and is not included 

in the consensus sequence for this domain (Braithwaite and Ito, 1993, supra), implying a 
lack of critical importance in dNTP recognition. 

A JDF-3 double mutant was constructed that contains mutations P410L and 
A485T. In dideoxynucleotide cycle sequencing, the banding pattern intensity 

25 demonstrated by the double mutant was extremely uniform, suggesting little if any 

preference for any dNTP over its corresponding ddNTP (See Figure 8 and Example 1Q). 
This polymerase characteristic improves the accuracy of base calling in automated 
sequencing. We presume that combinations of P410L and A485 mutations, L408H and 
A485 mutations, and L408F and A485 mutations would result in enzymes that exhibit 

30 improved ddNTP incorporation. The efficiency of dideoxynucleotide incorporation by 

such double mutant enzymes may also be characterized or quantitated by measurement of 
the I 5 o% as described herein to determine the relative degree of improvement in 
incorporation. 

EXAMPLES 

35 The following examples are offered by way of illustration only and are by no 

means intended to limit the scope of the claimed invention. 
EXAMPLE 1. 

A. Cloning a DNA polymerase gene from Thermococcus species JDF-3 DNA 
polymerase. 
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5 A Thermococcus species was cultured from submarine samples taken from the 

Juan de Fuca ridge. Genomic DNA was isolated and used to prepare a genomic DNA 
library in ZAP II (Stratagene) using standard procedures. The lambda library was plated 
on XL 1 -Blue MRP' E. coli and screened for clones with DNA polymerase activity using 
a variation of the method described by Sagner et al. (Sagner, G., Ruger, R., and Kessler, 

10 C. (1991) Gene 97:1 19-123). Plaques containing active polymerase were cored and 

stored in SM buffer. Positive primary plaques were re-plated and re-assayed to allow 
purification of isolated clones. Secondary clones were excised according to the 
instructions provided with the ZAP II system (Stratagene), and the DNA sequence of the 
insert determined (Figure 1). 

1 5 The translated amino acid sequence of the JDF-3 DNA polymerase is shown in 

Figure 2. Amino acid sequence alignments show that JDF-3 DNA polymerase exhibits 
homology to the class of DNA polymerases referred to as Family B. 

Recombinant JDF-3 DNA polymerase was purified as described below (see 
"Purification of JDF-3" (method 1)). The biochemical properties of JDF-3 DNA 

20 polymerase have been compared to those of other commercially available archaeal DNA 

polymerases. The results shown in Table IV and V indicated that, compared to other 
enzymes, JDF-3 exhibits higher processivity, a faster polymerization rate (Kc at ), and a 
greater tendency to utilize ddNTPs. JDF-3 DNA polymerase was therefore chosen for 
development of a DNA sequencing enzyme. 

25 Table IV. Polymerase Activities of Archaeal Family B DNA Polymerases 
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5 

a H. Kong, R.B. Kucera, and W.E Jack, J. Biol. Chem . 268 , 1965 (1993). 

B. Intein removal from the gene encoding JDF-3 DNA polymerase. 

By alignment to Family B DNA polymerase sequences, the JDF-3 DNA 
polymerase clone was found to contain an intein sequence (Figures 3 and 4). To improve 

10 expression of recombinant JDF-3 polymerase, the intein was removed by inverse PCR. 

PCR primers were designed to prime immediately upstream and downstream to the 
sequence coding for the intein termini, and were oriented such that the 3 ' ends of the 
primers were pointed away from the intein. The primers were also modified with 5'- 
phosphate groups to facilitate ligation. The plasmid/insert sequence was PCR amplified 

1 5 and circularized by standard methods. 

C. Construction of a JDF-3 DNA polymerase mutant with diminished 3'-5' 
exonuclease activity. 

DNA polymerases lacking 3 '-5' exonuclease (proofreading) activity are preferred 
for applications requiring nucleotide analog incorporation (e.g., DNA sequencing) to 

20 prevent removal of nucleotide analogs after incorporation. The 3' -5' exonuclease activity 

associated with proofreading DNA polymerases can be reduced or abolished by 
mutagenesis. Sequence comparisons have identified three conserved motifs (exo I, II, III) 
in the 3 '-5' exonuclease domain of DNA polymerases (reviewed V. Derbyshire, J.K. 
Pinsonneault, and CM. Joyce, Methods Enzymol . 262 , 363 (1995)). Replacement of any 

25 of the conserved aspartic or glutamic acid residues with alanine has been shown to 

abolish the exonuclease activity of numerous DNA polymerases, including archaeal DNA 
polymerases such as Vent (H. Kong, R.B. Kucera, and W.E. Jack, J. Biol. Chem . 268 , 
1965 (1993)) and Pfu (Stratagene, unpublished). Conservative substitutions lead to 
reduced exonuclease activity, as shown for mutants of the archaeal 9° N-7 DNA 
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polymerase (M.W. Southworth, H. Kong, R.B. Kucera, J. Ware, H. Jannasch, and F.B. 
Perler, Proc. Natl. Acad. Sci . 93, 5281 (1996)). 

JDF-3 DNA polymerase mutants exhibiting substantially reduced 3 '-5' 
exonuclease activity were prepared by introducing amino acid substitutions at the 
conserved 141D or 143E residues in the exo I domain. Using the CHAMELEON® 
Double-Stranded, Site-Directed Mutagenesis Kit (Stratagene), the following JDF-3 
mutants were constructed: D141A, D141N, D141S, D141T, D141E and E143A. 

To analyze JDF-3 mutant proteins, the DNA sequence encoding JDF-3 DNA 
polymerase was PCR amplified using primers GGG AAA CAT ATG ATC CTT GAC 
GTT GAT TAC (where Ndel site in bold and start codon underlined) and GGG AAA 
GGA TCC TCA CTT CTT CTT CCC CTT C (where BamHI site shown in bold type). 
The PCR products were digested, purified, and ligated into a high expression level vector 
using standard methods. Plasmid clones were transformed into BL21(DE3). Recombinant 
bacterial clones were grown using standard procedures and JDF-3 polymerase mutants 
were expressed in the absence of induction. The exonuclease and polymerase activities of 
recombinant clones were assayed using bacterial lysates. Typically, crude extracts were 
heated at 70°C for 15-30 minutes and then centrifuged to obtain a cleared lysate. 

There are several methods of measuring 3' to 5' exonuclease activity known in 
the art, including that of Kong et al. (Kong et al., 1993, J. Biol. Chem. 268: 1965) and 
that of Southworth et al. (Southworth et al., 1996, Proc. Natl. Acad. Sci. U.S.A. 93: 
5281), the full contents of both of which are hereby incorporated by reference. The 
exonuclease activity of wild type and active mutant polymerases as measured by the 
Kong et al. method were as follows: 

Exo activity (U/mg): 

Wt 915 

D141A7 

D141N953 

D141S954 

D141T0.5 

D141E940 

E143A0.3 

The combination exonuclease mutant D141A+E143A was made as described in 
section L. 

The El 43 A JDF-3 mutant (clone #550) exhibited significantly reduced 3 '-5' exo activity 
and was chosen for further mutagenesis to improve incorporation of ddNTP and other 
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nucleotide analogs. Other JDF-3 mutants with substantially reduced exonuclease activity 
could have been used for this purpose, such as the JDF-3 D141T mutant- for experiment 
or applications requiring the absolute elimination of 3' to 5' exonuclease activity, the 
double mutant D141A + El 43 A was preferred. 

D. Error-prone PCR amplification of the JDF-3 DNA polymerase gene. 

Random mutations were introduced into exo" JDF-3 by amplifying the entire gene 
(clone #550) under conditions which did not support high fidelity replication. To broaden 
the spectrum of potential mutations, three different PCR enzymes were used under error- 
prone conditions. 

In the preferred mode, ten reactions of lOOjil each were amplified with each PCR 
enzyme. 

i. Amplifiction with Taq DNA polymerase: 
Reaction Mixture 

lx magnesium free Taq Buffer (Stratagene catalog #200530) 

ImM each TTP and dCTP 

0.2 mM each dGTP and dATP 

2ng/|ul Primer 923 (also called 490) 

2ng/jnl Primer 721 

0.05u/jal Taq2000 (Stratagene catalog #600 1 95) 

1.5mMMgCl 2 

0.5mM MnCl 2 

O.lpM plasmid DNA (clone #550) 

Cycling Parameters 

PCRs were carried out using Stratagene' s ROBOCYCLER™40 Temperature 
Cycler with a Hot Top assembly. The following cycling conditions were used: 

1) 95°C for 1 minute 

2) 95°C for 1 minute 

3) 54°C for 1 minute 

4) 72°C for 2.5 minutes 

5) Repeat steps 2 to 4 thirty times. 

ii. Amplification with exo" JDF-3 DNA polymerase 
Reaction Mixture 

lx magnesium free Taq Buffer (Stratagene catalog #200530) 

450|uMeach deoxynucleotide (dGTP, dATP, TTP and dCTP) 
2ng/|ul Primer 923 (also called 490) 
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2ng/pl Primer 721 

0. lu/)Lil exo" JDF-3 DNA polymerase 

0.5mMMnCl 2 

0. lpM plasmid DNA (clone #550) 

Cycling Parameters 

PCRs were carried out using Stratagene's ROBOCYCLER™40 Temperature 
Cycler with a Hot Top assembly. The following cycling conditions were used: 

1) 95°Cfor 1 minute 

2) 95°C for 1 minute 

3) 54°C for 1 minute 

4) 72°C for 2.5 minutes 

5) Repeat steps 2 through 4 thirty times. 

iii. Amplifiction with exo" Pfu DNA polymerase 
Reaction Mixture 

1 x TAQPLUS® Precision Buffer (Stratagene catalog #6002 1 0) 

200^Meach deoxynucleotide (dGTP, dATP, TTP, dCTP) 
2ng/jil Primer 923 (also called 490) 

2ng/|ul Primer 721 

0.05u/|li1 exo" Pfu DNA polymerase (Stratagene catalog number 6001 63) 

0.1 pM plasmid DNA (clone #550) 

Cycling Parameters 

PCRs were carried out using Perkin-Elmer's 9600 Temperature Cycler. The 
following cycling conditions were used: 

1) 95°C for 1 minute 

2) 95°C for 1 minute 

3) 53°C for 1 minute 

4) 72°C for 5 minutes 

5) Repeat steps 2 through 4 thirty times. 
Forward Primers 

Earlier versions of the mutant libraries were made with the forward primer 461, 
which contains an EcoR I site. When products amplified with primers 461 and 923 were 
restriction digested and cloned into the lambda vector as described in the following 
section, JDF-3 DNA polymerase was synthesized as a fusion protein with the first 39 
amino acids of the vector-encoded (3-galactosidase (lacZ) protein. 

Primer 46 1 5 ' TCAGATGAATTCGATG ATCCTTGACGTTGATTAC 3 ' 
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5 EcoR I JDF-3 specific sequence 

The clones isolated using primer 461 were designed as p#. 

The preferred mode of amplification and cloning utilizes the forward primer 721, 
which also contains an EcoR I site followed by three consecutive in- frame stop codons 
and a ribosome binding site. This arrangement allows the JDF-3 DNA polymerase to be 
10 translated without any vector-derived residues at the amino terminus. The clones isolated 

from libraries constructed with the forward primer 72 1 were designated as 1 -# to 
differentiate them from the p# series of clones. 

Primer 721 

5' GAGAGAATTCA TAATGATAA GGAGGAAAAAATT ATGATCCTTGACGTTGATTAC 3" 
1 5 EcoR I 3x STOP JDF-3 specific sequence 

Reverse Primers 

Primer 923 (490) 5 ' TCAGATCTCGA GTCACTTCTTCTTCCCCTTC 3 ' 
Xho I JDF-3 specific sequence 
E. Preparing PCR products for cloning. 
20 PCR products were purified and concentrated with the STRATAPREP™ PCR 

Purification kit (Stratagene catalog number 400771). The PCR products were then 
digested with 50 units of Xho I and 50 units of EcoR I in 1 .5x Universal buffer (lOx 
Universal Buffer: 1M KOAc, 250mM Tris-Acetate (pH 7.6), lOOmM MgOAc, 5mM 0- 
mercaptoethanol and 100 |ug/ml BSA) for one hour at 37°C. The digested samples were 
25 run on a 1% agarose, lx TBE gel and visualized with ethidium bromide staining. The 

2.3kb amplification product was gel isolated and purified with the STRATAPREP™ 
DNA Gel Extraction Kit (Stratagene catalog number 400766). 

F. Cloning PCR Inserts into the Uni-Zap®XR Lambda Vector. 

200ng of purified amplification product was ligated with ljug of UNI-ZAP®XR 
30 Lambda Vector (Stratagene catalog #239213), which had been predigested with EcoR I 

and Xho I and then dephosphorylated with alkaline phosphatase (Stratagene catalog 
number 23721 1). The DNAs were ligated using 2 units of T4 DNA ligase (Stratagene 
catalog number 60001 1) and 0.5mM ATP in lx ligase buffer (50mM Tris-HCL (pH 7.5), 
7mM MgCl2, ImM DTT) in reaction volumes of 10 to 15)ul. Ligations were carried out at 
35 1 6°C for a minimum of 1 6 hours. 

G. Lambda Packaging and Bacterial Infection. 

Two microliters of each ligation reaction were packaged with GIGAPACK® III 
Gold Packaging extract (Stratagene catalog #200201) for 90 minutes at room temperature 
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before being stopped with 500jil SM buffer (50mM Tris pH 7.5, lOOmM NaCl, 8mM 
MgS0 4 and 0.01% 

gelatin) and 20jlx1 of chloroform. The packaged lambda vectors were plated on E. 
coli XL 1 -Blue MRF' host cells. 
EL Dideoxynucleotide Screening. 

Mutant polymerase libraries were screened using a variation of the technique used 
by Sagner et al (Sagner, G., Ruger, R., and Kessler, C. (1991) Gene 97:1 19-123). 
Lambda phage clones were plated at a density of 10-20 plaques per square centimeter. 
Proteins present in the plaques were transferred to filters and moistened with polymerase 
screening buffer (50mM Tris (pH 8.0), 7mM MgCl 2 , 3mM p-ME). The filters were kept 
between layers of plastic wrap and glass while the host cell proteins were heat-inactivated 
by incubation at 65°C for 30 minutes. The heat-treated filters were transferred to fresh 
plastic wrap and approximately 35|il of the polymerase assay cocktail was added for 
every square centimeter of filter. Polymerase assay cocktail consisted of lx cloned Pfu 
magnesium-free buffer (Stratagene catalog #200534), 125ng/ml activated calf thymus or 
salmon sperm DNA, 1.29 juCi/ml - 33 P ddNTP (Amersham), and 0.5mM MnCl 2 . Initial 
screening was done in the presence of MnCl 2 , but the preferred method was to screen in 
IX Taq Polymerase buffer (1.5 mM MgCl2). The filters were sandwiched between 
plastic wrap and glass again and incubated at 65°C for one hour, and then at 70°C for one 
hour and 15 minutes. The filters were washed three times in 2x SSC for five minutes each 
time before being rinsed twice in 100% ethanol and dried on a vacuum dryer. The filters 
were exposed to X-ray film for approximately 16 hours. Plaques corresponding to strong 
signals were cored and placed in SM buffer. The positive primary plaques were replated 
at more dilute concentrations and assayed under essentially similar conditions to allow 
the purification of isolated plaques. 

Dye-dideoxynucleotide Screening 

To detect mutant polymerases with improved capacity for dye-deoxynucleotide 
and dye-dideoxynucleotide utilization, the JDF-3 mutant DNA polymerase library was 
screened as described previously with the following exceptions: 

Polymerase assay cocktail for Flu-12-dUTP screening: 

0.9X Taq Buffer (Stratagene Catalog #200435), 65 pM dATP, 65 \iM dCTP, 65 
\xM dGTP, 65 pMdTTP, 0.3 yM Fluoresceince-12-dUTP (Stratagene in-house 
production), 0.75 Hg/pl activated calf thymus DNA. 

Polymerase assay cocktail for ROX ddNTP 
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5 IX Taq Buffer, 0.9 juM dATP, 0.9 dCTP, 0.9 jliM dGTP, 0.9pl TTP, 0.6 jiM 

ROX ddATP (New England Nuclear (NEN) NEN478), 0.06 juM ROX ddGTP (NEN 
NEL479), 0.06 |LiM ROX ddCTP (NEN NEL477), 0.06 j^M ROX ddUTP (NEN 
NEL476), 0.84 p.g/)ul activated calf thymus DNA. (Note: A screening system without 
ROX ddGTP is the preferred method since DNA polymerases do not discriminate against 
10 ddGTP). 

Polymerase assay cocktail for Fluroesceine ddUTP 

IX Taq Buffer, 70 juM dATP, 70 |LiM dTTP, 70 jaM dCTP, 15 jliM dTTP, 1 juM 
Fluroesceine- 12-ddUTP (NEN NEL401), 0.84 \xg/\x\ activated calf thymus DNA. 
Antibody binding to fluroesceine 
15 The filters were blocked overnight with 1% non-fat dry milk dissolved in TBST 

(50 mM Tris pH 8.0, 150 mM NaCl, 0.05% Tween-20) at 4°C. The filters were washed 
briefly in TBST before alkaline phosphatase conjugated anti-fluoresceine antibody from 
the Illuminator kit (Stratagene catalog #300360) was added at a 1/10,000 dilution in 50 
ml TBST. The antibody was detected with NBT/BCIP at concentrations of 0.3 mg/ml 
20 and 0.15 mg/ml respectively in a buffer composed of 100 mM Tris pH 9.5, 100 mM 

NaCl, and 5 mM MgCl 2 . 

Antibody binding to Rhodamine 

Anti-ROX antibody (Zymed cat. no. 71-3600 rabbit Rhodamine (5-ROX 
polyclonal, 1 mg/ml)) was diluted to 1:1000 in TBST. The blocked filters were blotted 

25 briefly to remove excess moisture then laid on plastic wrap and covered with 2.5 ml of 

the diluted antibody solution. An additional sheet of plastic wrap was laid over the filters 
before incubation at room temperature for 1 hour. The filters were washed briefly three 
times with TBST, then washed three times with gentle agitation for 15 minutes each time. 
The washed filters were incubated with alkaline phosphatase conjugated goat anti-rabbit 

30 antibodies diluted 1 :5000 in TBST. The filters were incubated with the antibody for one 

hour then detected with NBT/BCIP as described previously. 
I. Dideoxynucleotide Qualification 

Lambda phage clones which incorporated 33 P-labeled ddNTPs in the 
primary library screen were re-screened to verify polymerase activity and to assess the 

35 contribution of the divalent metal ion to 33 P-ddNTP incorporation. The clones selected 

during this round of screening were designated as p#. These clones all contained an 
amino-terminal tag, as discussed in the section entitled "Forward Primers". Figure 5 
shows that clones pi, p2, p3, p6, p7, p8, p9, plO, pi 1, pl2, pl4, pl5, and pl6 exhibited 
wild type levels of DNA polymerase activity, based upon similarity in signal strength to 
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the parental #550 clone (Figure 5, panel 3). Although initial screening was carried out in 
the presence of 0.5mM MnCl 2 , all of the clones except p9 and plO were able to 
incorporate 33 P-labeled ddNTPs to at least some extent in the presence of 1 .5mM MgCl 2 
(panel 2), with clones p2, p4, p8 5 pi 1, pl2, pl3, pl4, pl5 5 pl7, and pl8 producing the 
highest signals. 

Eighteen mutants were chosen for evaluation. One microliter of phage isolated 
from each purified plaque was placed on each of three E. coli XL 1 -Blue MRF' lawns. 
Phage containing a parental copy of exo" JDF3 DNA (#550 clone) were also spotted on 
the grid. The plaques formed by the phage were transferred to filters and treated as 
described in the preceding screening section with the exception of the final buffer 
composition. The buffers used for each filter (filters 1-3) are as follows: 

Filter 1: Dideoxynucleotide screen with manganese chloride 

1 x Taq DNA polymerase magnesium-free buffer 

1.28 nCi/ml 33 P ddNTPs 

0.5 jig/|Lil Activated Calf Thymus DNA (Sigma) 
0.5 mM MnCl 2 

Filter 2: Dideoxynucleotide screen with magnesium chloride 

1 x Taq DNA polymerase buffer (containing 1 .5mM MgCl 2 , catalog #200435) 
1.28 jaCi /ml 33 P ddNTP 

0.5 jug/iLil Activated Calf Thymus DNA (Sigma) 

Filter 3 : Deoxynucleotide screen with magnesium chloride 

1 x Taq DNA polymerase buffer 
0.072 mM dGTP, dCTP and TTP 
40 \xM dATP 

0.5 jug/ml Activated Calf Thymus DNA (Sigma) 
0.01 ixCi - 33 PdATP. 
Results are shown in Figure 5. 
Dye-dideoxynucleotide Qualification 

As described in the previous segments, primary lambda clones were spotted on an 
E. coli lawn and re-screened with the appropriate antibody or antibodies. 
J. Excision of Lambda Clones. 

When incubated with helper phage under suitable conditions, Lambda Zap™ 
vectors are designed to produce phagemid copies of the part of the vector containing 
pBluescript (SK-) and the insert. This process yields a plasmid (pBluescript SK-) vector 
carrying the same insert that was contained in the lambda clone. Excision of clones with 
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the desired phenotype was carried out according to the instructions in the EXASSIST™ 
system (Stratagene catalog #200253). 
K. Sequence Analysis of Mutants. 

The mutants were sequenced by Sequetech Corporation(Mountain View, 
California) using the following primers: 

Primer 3 (or primer G) 5' CCAGCTTTCCAGACTAGTCGGCCAAGGCC 

3' 

Primer 5 (or JDF3-1 128) 5' AACTCTCGACCCGCTG 3' 
L. Dideoxynucleotide mutagenesis. 

To conclusively identify the amino acids contributing to reduced ddNTP 
discrimination, individual point mutations were introduced into the exo" JDF-3 #550 
clone using the QUIKCHANGE™ Site-Directed Mutagenesis Kit (Stratagene catalog 
#200518). The following mutants were prepared: L408H, L408F, P410L, A485T, S345P, 
D373Y, A619V, and L631V. In addition, a double mutant (P410L/A485T) was 
constructed by introducing the A485T mutation into the exo" JDF-3 P410L mutant clone. 
To completely eliminate all 3' to 5' exonuclease activity, the mutation D141 A was added 
to all clones. A pre-existing 5' to 3' exonuclease mutation (El 43 A) was present in the 
parental template JDF-3 550. 

Dye-dideoxynucleotide mutagenesis 

To conclusively identify amino acids responsible for contributing to reduced 
discrimination of dye nucleotides, the mutation S345P was generated alone and in 
combination with the P410L and P410L + A485T. 
M. Preparation of heat-treated bacterial extracts. 

E. coli SOLR cells containing the excised plasmid were grown overnight at 37°C. 
The cells contained in 500jli1 of culture were collected by microcentrifugation. The cell 
pellets were resuspended in 50pl of 50mM Tris (pH 8.0). Lysozyme was added to a final 
concentration of 1 jug/pl, and the cells were lysed during a 10 minute incubation at 37°C, 
followed by 10 minutes at 65°C. The heat-inactivated cell material was collected by 
microcentrifugation and the supernatants were assayed for dNTP and ddNTP 
incorporation as described below. 

N. Purification of JDF-3 and JDF-3 polymerase mutants. 

One method for purifying exo" JDF-3 DNA polymerase involves ammonium 
sulfate fractionation, followed by Q Sepharose and DNA cellulose columns. A second 
method has been developed to allow rapid purification of JDF-3 polymerase mutants, and 



-50- 



WO 01/32887 



PCT/US00/29706 



5 entails adsorption of contaminants on a HiTrap Q column, followed by gradient elution 

from a HiTrap heparin column (section iii). 

i. Preparation of bacterial lysate. 

Frozen cell paste (3-14 grams) was resuspended with 3x volume of lysis buffer, 
consisting of 50mM Tris-HCl (pH 8.0), 1 mM EDTA, and 10 mM p-mercaptoethanol. 

10 Lysozyme was added to 0.2mg/ml and PMSF was added to ImM final concentration. The 

cells were lysed on ice over a period of 1 hour. The lysate was then sonicated for 2 
minutes (90% duty, level of 2x2.5, 1x3.0). Following sonication, the lysate was heated at 
65°C for 15 minutes to denature bacterial proteins. The heated lysate was then 
centrifuged for 30 minutes at 14.5K rpm in a Sorvall RC-2B centrifuge using a Sorvall 

15 SS-34 rotor, and the supernatant was recovered. 

ii. Ammonium sulfate fractionation and Q Sepharose/DNA cellulose 
chromatography (method 1) 

Ammonium sulfate was added to the bacterial lysate to a final concentration of 
45%. The ammonium sulfate was added over a period of 15 minutes, and the mixture was 

20 stirred for an additional 30 minutes. The mixture was centrifuged as described above, and 

the supernatant was recovered. Additional ammonium sulfate was then added to bring the 
final concentration to 65%. The mixture was centrifuged as described above, and the 
supernatant removed. The pellet was resuspended in 10ml of buffer A consisting of 
50mM Tris-HCl (pH 7.5), 1 mM EDTA, lOmM (3-mercaptoethanol, 0.1% (v/v) Tween 

25 20, and 10% (v/v) glycerol. The supernatant was dialyzed overnight against 2 changes of 

buffer A (3 liters each). 

The dialysate was loaded onto a 2.6 x 9.4cm Q-Sepharose Fast Flow column ( 
50mls), pre-equilibrated in buffer A. The column was washed with buffer A until the 
absorbence (OD 2 8o) approached baseline. The column was then eluted with a gradient 

30 from 0 to 1M NaCl/buffer A. Fractions were collected, and analyzed by SDS-PAGE and 

DNA polymerase activity assays (see below). Active protein typically eluted between 130 
and 240mM NaCl. Active fractions were pooled and dialyzed overnight against 2 
changes of buffer B (3 liters each), consisting of 50mM Tris-HCl (pH 7.5), 1 mM EDTA, 
lOmM P-mercaptoethanol, 0.1% (v/v) Tween 20, 10%(v/v)glycerol, and 50mM NaCl. 

35 The Q-Sepharose eluate was then loaded onto a 1 .6 x 4.9cm ( lOmls) 

DNA cellulose column, equilibrated in buffer B. The column was washed with buffer B 
until the absorbence (OD 2 so) approached baseline. The column was then eluted with a 
gradient from 50 to lOOOmM NaCl/buffer A. Fractions were collected, and analyzed by 
SDS-PAGE and DNA polymerase activity assays. Active protein typically eluted 
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between 280 and 360mM NaCl. Active fractions were pooled and dialyzed overnight 
against JDF-3 final dialysis buffer, consisting of 25mM Tris-HCl (pH 7.5), lOOmM KC1, 
O.lmM EDTA, ImM DTT, 0.1% (v/v) Tween 20, 0.1% (v/v) Igepal 630, lOfxg/ml BSA, 
and 50% (v/v) glycerol. 

iii. HiTrap Q/HiTrap heparin chromatography (method 2) 

The preferable method for rapid purification of multiple mutants is as follows. 
Bacterial cell lysates were prepared as described for method 1, except that Tween 20 and 
Igepal CA 630 were added to a final concentration of 0.0 1% (v/v) just prior to the heat 
denaturation step, and a heat denaturation temperature of 72 °C was used. 

The lysate was loaded onto a 1.6 x 2.5cm ( 5mls) HiTrap Q column (pre-packed 
column from Pharmacia), pre-equilibrated in buffer C consisting of 50mM Tris-HCl (pH 
8.2), ImM EDTA, lOmM (3-mercaptoethanol, 0.1% (v/v) Tween 20, and 0.1% (v/v) 
Igepal CA 630. The column was washed with buffer C until the absorbence (OD 2 8o) 
approached baseline. The flow through fractions (OD 2 8o absorbence above background) 
were collected and then loaded onto a 1.6 x 2.5cm ( 5mls) HiTrap heparin column (pre- 
packed column from Pharmacia), pre-equilibrated in buffer D consisting of 50mM Tris- 
HCl (pH 8.2), ImM EDTA, ImM DTT, 0.1% (v/v) Tween 20, 0.1% (v/v) Igepal CA 630, 
and 10% glycerol (v/v). The column was washed with buffer D until the absorbence 
(OD 2 8o) approached baseline. The column was then eluted with a gradient from 0 to 1M 
KCl/buffer D. Fractions were collected, and analyzed by SDS-PAGE and DNA 
polymerase activity assays. Active protein typically eluted between 390 and 560mM 
NaCl. Active fractions were pooled and dialyzed overnight against JDF-3 final dialysis 
buffer (see above). Purified polymerases were stored at -20 C. 

iv. Analysis of Purified Proteins 

The concentrations of JDF-3 and mutant DNA polymerases were determined 
relative to a BSA standard (Pierce), using Pierce's Coumassie Blue Protein assay reagent. 
In addition, the purity and relative protein concentrations of different polymerase 
preparations were verified by SDS-PAGE. Polymerase samples were electrophoresed on 
4-20% Tris-glycine gels (Novex), and the gels were silver-stained using standard 
procedures. 

O. Nucleotide Incorporation Assay. 

DNA polymerase activity was measured using purified JDF-3 polymerase 
mutants or heat-treated bacterial extracts prepared from various mutant clones. DNA 
polymerase activity was measured by monitoring the incorporation of 3 H-TTP into 
activated calf thymus DNA. A typical DNA polymerase reaction cocktail contained: 
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5 10mMTris-HCl,pH8.8 
1.5mM MgCl 2 
50mM KC1 
0.001% gelatin 

200juM each dATP, dCTP, dGTP 
10 195jiMTTP 

5(iM [ 3 H]TTP (NEN #NET-221H, 20.5Ci/mmole; partially evaporated to remove 
EtOH) 

250jag/ml of activated calf thymus DNA (e.g., Pharmacia #27-4575-01) 
Incorporation was measured by adding 1 ]ul of polymerase samples to 1 0)ul 

15 aliquots of polymerase cocktail. DNA polymerase samples were diluted in a suitable 

storage buffer (e.g., 25mM Tris-HCl (pH 7.5), lOOmM KC1, O.lmM EDTA, ImM DTT, 
0.1% (v/v) Tween 20, 0.1% (v/v) Igepal 630, lOpg/ml BSA, and 50% (v/v) glycerol). 
Polymerization reactions were conducted for 30 minutes at 72°C. Extension reactions 
were quenched on ice, and then 5\x\ aliquots were spotted immediately onto DE81 ion- 

20 exchange filters (2.3cm; Whatman #3658323). Unincorporated [ 3 H]TTP was removed by 

6 washes with 2xSCC (0.3M NaCl, 30mM sodium citrate, pH 7.0), followed by a brief 
wash with 100% ethanol. Incorporated radioactivity was measured by scintillation 
counting. Reactions that lacked enzyme were also set up along with sample incubations 
to determine "total cpms" (omit filter wash steps) and "minimum cpms"(wash filters as 

25 above). 

Cpms bound is proportional to amount of polymerase activity present per volume 
of bacterial extract. The volume of bacterial extract (0.25- ljxl) which brought about 
incorporation of approximately 10,000 cpms was determined for use in subsequent 
nucleotide analog incorporation testing. 

30 P. Quantitating ddNTP incorporation efficiency. 

JDF-3 polymerase mutants were evaluated to assess relative ddNTP incorporation 
efficiency. Nucleotide incorporation was measured in the presence of varying 
concentrations of each ddNTP terminator (ddATP, ddCTP, ddGTP, and ddTTP). Since 
ddNTP incorporation produces non-extendable termini, polymerization is strongly 

35 inhibited for polymerases that incorporate ddNTPs efficiently. Comparisons of ddNTP 

concentrations that bring about 50% inhibition of nucleotide incorporation (Iso%) can be 
used to compare ddNTP incorporation efficiency of different polymerases or polymerase 
mutants. Comparisons of Iso% values for ddATP, ddCTP, ddGTP, and ddTTP can be used 
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5 to identify mutants with reduced selectivity for particular bases. Such mutants would be 

expected to produce more uniform DNA sequencing ladders. 

To measure incorporation of individual ddNTPs, cocktails were prepared which 
consisted of varying concentrations of the ddNTP of interest, and a total of 200uM of 
each nucleotide triphosphate. For example, the incorporation of ddATP by wild type 
10 JDF-3 polymerase was measured at 0, 40, 80, 120 and 160uM ddATP. In these reactions, 

dATP concentrations were adjusted to 200, 160, 120, 80, and 40uM, respectively, so that 
the total amount of adenine nucleotide triphosphate was 200uM. In comparison, mutants 
were assayed using ddATP concentrations of 0, 5, 10, and 20uM ddATP, and adjusted 
dATP concentrations of 200, 195, 190, and 180uM, respectively (dATP + ddATP = 
15 200uM). Additional cocktails were prepared to measure ddCTP, ddGTP, and ddTTP 

incorporation. To assess ddNTP incorporation by JDF-3 mutants at 3 different ddNTP 
concentrations, 12 reaction cocktails were prepared consisting of: 
lOmMTris-HCl, pH 8.8 
1.5mMMgCl 2 
20 50mM KC1 

0.001% gelatin 

5uM [ 3 H]TTP (NEN #NET-221H, 20.5Ci/mmole; partially evaporated to remove 

EtOH) 

250ug/ml of activated calf thymus DNA (e.g., Pharmacia #27-4575-01) 
25 To each of 12 reaction cocktails was added the appropriate amounts of dNTPs and 

ddNTPs as summarized below: 
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Incorporation was measured by adding ljul of appropriately diluted bacterial 

extract 

(10,000 cpms) to 10^1 of each polymerase cocktail. Polymerization reactions were 
conducted for 30 minutes at 72°C. The extension reactions were counted as described 
1 0 above. 

Reactions that lacked enzyme were also set up along with sample incubations to 
determine "minimum cpms"(wash filters as above). To determine % activity as a function 
of ddNTP concentration, background ("minimum cpms" value) was first subtracted from 
each of the sample cpms. "Total cpms", which are equivalent to 100% activity (0 
1 5 ddNTPs), are determined by averaging the corrected cpms for the 4 reactions lacking 

ddNTPs (A-0, G-0, C-0, and T-0). Percent remaining activity was then calculated by 
dividing corrected sample cpms (with ddNTPs) by the corrected total cpms (average 0 
ddNTPs). 

Percent activity was plotted as a function of ddNTP concentration. I 50 % values for 
20 each ddNTP (ddNTP concentration which inhibits nucleotide incorporation by 50%) were 

determined for each mutant. Comparisons allowed the identification of mutants with 
improved ddNTP incorporation relative to wild type JDF-3. 

Initial studies used purified enzymes, and Iso% values were determined from 
inhibition plots employing 40-160|uM ddNTPs. The results in Table V show that mutants 
25 p8 (P410L), pi 1 (P410L), and pl2(A485T) are inhibited by lower concentrations of 

ddNTPs than the parental exo" JDF-3 polymerase. Greater sensitivity indicates that the 
mutants incorporate all four ddNTPs more efficiently than the original JDF-3 polymerase. 

For enzymes which preferentially incorporate TTP over ddTTP (exo" JDF-3, exo" 
Pfu), the use of increasingly higher concentrations of ddTTP (80-160|uM) and 

30 correspondingly lower concentrations of TTP (1 15-35|uM), in combination with a 

3 

constant amount of [ H]TTP (5jjM), leads to an increase in cpms incorporated with 

increasing ddNTP concentration. Therefore, in these initial experiments (where ddTTP 
>120)uM), 150% values for TTP are artificially high. While they can be used to compare 
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ddTTP incorporation among different polymerase mutants, they can not be used to assess 
reduced/enhanced preference for ddTTP relative to ddCTP, ddGTP, or ddATP. 
Table V I 50 % Values for Purified JDF-3 and JDF-3 Mutants. 
Purified I 50% Values (uM) 

Polymerase rima 

«T 

utati 
on 

dAT dGT dCT dl 

P P P P 



60 10 160 >1 

160 160 160 >1 

410L 0 5 0 0 

410L 0 0 0 16 

485T 0 5 5 50 



Exo" JDF-3 
Exo" Pfu 
JDF-3 mutant 

p8 

JDF-3 mutant 

pll 

JDF-3 mutant 

P 12 



To allow a larger number of mutant clones to be screened, subsequent 
experiments employed bacterial extracts containing JDF-3 polymerase mutants. In 
addition, sensitivity was improved by using lower concentrations of each ddNTP 
inhibitor (5-20pM). The results in Table VI demonstrate that all of the mutants selected 
from the primary filter screen exhibited improved incorporation of ddNTPs. 
Improvements in ddNTP incorporation were as high as >20-fold. All of the mutants 
containing a mutation at amino acid 408 (L408H/F), 410 (P410L), or 485 (A485T) 
(referred to as the "primary mutation") exhibited reduced discrimination against all four 
ddNTPs. Most, but not all, mutants with the L408H/F primary mutation produced very 
similar I 50 % values (<2-fold difference) for all four ddNTPs, indicating that base 
selectivity is diminished or absent. 

Table VI. I 50 o /o Values for JDF-3 Mutants (Bacterial Extracts). 
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JDF-3 mutant 
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Exo" JDF-3 — — _____ ^ = 

80 80 80 80 

1-1, 1-4, 1-18 

408 to to 5 to 13 .5. 

H >20 10 

1-25, 1-28, 1- 

29, 1-17 408 .5 to .5 to to to 

F >20 10 6.5 



p8 



1-5, 1-6, 1-17 



410 8.5 2 .5 20 
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410 Oto .5 to 6.5 to It 

L >20 9 >20 >2 



1-41, 1-38, 1- to 
37,1-3, ot to .5 to to >2 

1-19, 1-30, 1- deter >20 12 >20 

27, 1,20 mine 

1-26, 1-32, 1- d 
16, 1-12 

Q. Sequencing with purified JDF-3 polymerase mutants. 

i. Sequencing with radioactively labeled dideoxynucleotides 
1 to 2jxl of purified enzyme was substituted into the Thermo Sequenase 
radiolabeled terminator cycle sequencing kit (Amersham-Pharmacia #US79750). The 
samples were processed according to the manufacturer's instructions using the control 
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5 primer and template provided with the kit. Three microliters of each sequencing reaction 

were loaded onto a 6% acylamide-7M urea, Ix TBE CASTAWAY™ Precast gel 
(Stratagene catalog #s 401090 and 401094). When the bromophenol blue indicator dye 
reached the end of the gel, the gel was fixed, dried and exposed to film for 24-72 hours 
(Figure 6). 

10 The results in Figure 6 show that clones pi 1 (panel D) and p8 (panel E) exhibit a 

dramatic improvement in the incorporation of all four ddNTPs compared to the parental 
#550 clone (panel B). Mutants pi 1 and p8 both contain the primary P410L mutation and 
an amino tag, but differ with respect to the number and types of ancillary mutations. 
Mutant p 12 (panel C) produced a faint sequencing ladder, presumably due to the use of 

15 an insufficient amount of enzyme or the presence of ancillary mutations which reduce 

thermal stability. There is evidence of termination products in all lanes, suggesting an 
improvement in the incorporation of all four ddNTPs relative to the parental clone. 
Mutant pl2 contains the primary mutation A485T in addition to ancillary mutations. In 
contrast to JDF-3 mutants identified here, the parental clone shows a strong preference to 

20 incorporate ddGTP, as evidenced both in primer extension (Figure 6) and ddNTP 

inhibition assays (Tables V and VI). 

ii. Sequencing with a radioactively labeled primer and fluorescent 
dideoxynucleotides 

Different DNA polymerases and polymerase mutants will exhibit varying degrees 
25 of discrimination against the dye moieties on the dideoxynucleotide analogs. An 

assessment of usage of dye-labeled dideoxynucleotide analogs by the JDF-3 polymerase 
mutants was carried out. The procedure used was as follows: 

a. Primer Labeling 

The sequencing primer SK was radioactively labeled with the KINACE-IT™ 
30 Kinasing Kit (Stratagene catalog #200390). The incubation reaction (40|il) contained the 

following components: 

IX kinase buffer #1 

0.75 fiCi/nl y- 33 P ATP 

0.375 u/ju.1 T4 polynucleotide kinase 
35 2.5 pmol/]ul SK primer 

The reaction was incubated at 37°C for 45 minutes. The primer was purified 
away from free nucleotides with a size exclusion matrix (NUC TRAP® Stratagene 
catalog number 400701). 

b. Dye labeled-dideoxynucleotide : dNTP ratios 
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Fluorescent dideoxynucleotides were purchased from New England Nuclear 



10 



15 



20 



25 



30 



35 



(NEN): 



R6G-ddATP 
RllO-ddTP 
TAMRA-ddUTP 
ROX-ddCTP 



NEN catalog number NEL-490 
NEN catalog number NEL-495 
NEN catalog number NEL-472 
NEN catalog number NEL-477 
Incorporation was measured using 3 different concentrations of dye labeled 
dideoxynucleotides (ddNTPs) and a constant amount of deoxynucleo tides 
(dNTPs; 2.14uM): 

(2-14uMeachdNTP 



Condition 3) 1 : 1 



ddNTP) 



Condition 2) 1 : 0.1 



ddNTP) 



Condition 1)1 : 0.01 



(2.14uMeachdNTP 
(2.14uMeachdNTP 



2.14uM dye-labeled 
0.214uM dye-labeled 
0.02 14uM dye-labeled 



ddNTP) 



c. Preparation of the DNA Sequencing Reaction Mixtures 

Four polymerases were tested for utilization of dye-labeled ddNTPs, exo" JDF-3 
(#550 clone), Thermo Sequenase (4u/ul), JDF-3 P410L (clone p8 with ancillary 
mutations and an amino-terminal tag) and JDF-3 L408H (clone 1-1). A mixture 
containing the following reagents was assembled: 

13.7ul H 2 0 

1 ul labeled SK primer (2 pmol/ul) 
lul pBluescript KS (0.2ug/ul) 
lul polymerase (~1.5u/ul) 

2ul 10X buffer (reaction buffer 1 for all but L408H which uses 1 .5mM MgCl 2 , 
buffer (see below) 

IPX Reaction Buffer 1 
260mM Tris pH 9.5 
65mM MgCl 2 
IPX 1.5mM MgCl 2 buffer 
24 mMMgCl 2 
26P mM Tris pH 9.5 

2.5 ul of each dye-labeled ddNTP terminator (ddGTP, ddATP, ddTTP and 
ddGTP was aliquotted separately into one of four tubes. 4.5ul of each polymerase 
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reaction was added to each of the four tubes, to give a final reaction volume of 

7fil. 

d. Cycle Sequencing Reactions 

The samples were cycled in a RoboCycler®96 Temperature Cycler with 
a Hot Top Assembly (Stratagene Catalog #400870 and #400894) using the 
following conditions: 

1) 1 minute at 95°C 

2) 1 minute at 95°C 

3) 1 minute at 50° C 

4) 2 minutes at 72° C 

5) Repeat steps 2-4 thirty times. 

4 jlxI of stop solution (95% formamide, 20mM EDTA, 0.05% bromophenol blue, 
0.05% xylene cyanol FF) was added to each of the amplified reactions before heating 
them to 99°C for five minutes. The samples were electrophoresed on a 6% 
CASTAWAY™ gel as described above. The gels were dried and then exposed to film 
for 72 hours (Figure 7). 

The results of studies designed to assess utilization of dye-labeled ddNTPs by the 
different polymerase clones are shown in Figure 7. Clones p8 (panel C) and 1-1 (panel D) 
exhibited significantly improved incorporation of R6G-ddATP and Rl 10-ddGTP, 
compared to the parental clone (panel A). Improvement was evidenced by the synthesis 
of sequencing ladders at .01x (1) and O.lx (2) dye-ddNTP/dNTP ratios. Optimization of 
reaction conditions and/or dye moieties may be performed to realize improvements in the 
incorporation of ddTTP and ddCTP. 

iii. Sequencing with double-mutant exo" JDF-3 DNA polymerase. 

To verify that changes at residues 408, 410, and 485 were sufficient to improve 
ddNTP incorporation, individual mutations were introduced into the parental 550 (JDF-3 
exo" DNA polymerase) clone by site-directed mutagenesis. In addition, point mutations 
were combined to examine whether they resulted in further improvements in 
dideoxynucleotide incorporation over polymerases bearing single mutations. 

DNA sequencing reactions consisting of lx reaction buffer, 0.15pmol/|ul long -20 
primer, and 10ng/|Lig pBluescript KS were prepared as follows: 

81jul H 2 0 

9jal -20 long primer (2pmol/^xl) 
6|ul pBluescript KS (0.2jug/]Lil) 
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5 **jul polymerase 

12]Lil 10X buffer (260mM Tris pH 9.5, 65mM MgCl 2 ) 

1 8jlx1 of the cocktail listed above was aliquotted into the appropriate number of 
tubes (one per polymerase). Each polymerase (2 ^1) was added to an aliquot of cocktail 
and the tubes were mixed well. Each resulting polymerase mixture (4.5 jul) was then 
10 added to each of four tubes, already containing 0.06mM of one of the four - 33 P- 

dideoxynucleotides (ddATP, ddTTP, ddGTP or ddTTP; 1500Ci/mmol; 450|uCi/ml) and 
6mM each deoxynucleotide in a volume of 2.5jil. 

The sequencing reactions were cycled in a ROBOCYCLER®96 temperature 
cycler with a Hot Top Assembly using the following conditions: 
15 1) 1 minute at 95 °C 

2) 45 seconds at 95°C 

3) 45 seconds at 60°C 

4) 1.5 minutes at 72 °C 

5) Repeat steps 2-4 thirty times. 

20 Stop solution (jil; 95% formamide, 20mM EDTA, 0.05% bromophenol blue, 

0.05% xylene cyanol FF) was added to each reaction before heating to 99°C for five 
minutes. Each sample (4)al) was loaded onto a 6% acrylamide denaturing CastAway gel. 
The gel was run and treated as described previously. 

Figure 8 shows that the P410L/A485T double mutant exhibits exceptionally even 

25 signals. Band uniformity was improved compared to mutant p8 (P410L mutation plus 

ancillary mutations that do not include A485T) and mutant A485T (data not shown). 
Mutant p8 exhibited a tendency to preferentially incorporate ddGTP and ddCTP in a 
sequence-dependent fashion. The optimal amount of enzyme may be higher than the 
quantity tested in this experiment. Sequence produced by the commercially available 

30 Family A DNA polymerase mutant, Thermo Sequenase, is shown in panel E. 

iv. Ribonucleotide incorporation by JDF-3 polymerase mutants. 
A primer annealed to single stranded DNA template was extended in a mixture 
containing all ribonucleotides or all deoxynucleotides with the mutant and progenitor 
polymerases. 

35 M13mpl8+ single stranded DNA was annealed to 95x molar excess of the 38mer 

primer by heating the mixture to 95 °C and cooling slowly at room temperature. 

38mer primer: 5' GGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGT 

V 
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5 Preliminary assays were carried out to determine what dilutions of enzyme would 

be necessary to examine the incorporation activity at non-maximal levels. The final 
assay solutions were composed as described below: 
Ribonucleotide mixture 
20 ng/^il annealed primer/template 

10 1 x Cloned Pfu buffer (Stratagene catalog #200532) 

200juM each OTP, UTP, ATP 
50|uM CTP 
1 fiM 5- 3 H CTP 20.2 Ci/mmole 

0.05-0.3 units JDF-3 polymerase* 
15 Deoxyribonucleotide mixture 

20 ng/]ul annealed primer template 

lx Cloned Pfu buffer 

200|uM each dGTP, dATP, dCTP 
50jiM TTP (deoxyribonucleotide) 

20 ljuM Thymidine 5' -triphosphate, [methyl- 3 H] 20.5 

Ci/mmole 

0.05-0.3 units JDF-3 polymerase* 
*Added separately 

Nine microliters of the polymerase-free mixtures were placed in 0.2ml tubes 
25 before the polymerases were added. The samples were incubated at 72°C in a 

ROBOCYCLER®96 temperature cycler with Hot Top Assembly (Stratagene Catalog 
Nos. 400870 and 400894). The deoxyribonucleotide mixture was removed at 2 minutes 
and placed at approximately 2°C. The ribonucleotide mixture was incubated for 30 
minutes. Seven microliters of the assay mixture were spotted onto DE81 filter circles 
30 (Whatmann) and dried prior to being washed three times in 2x SSC (0.3M NaCl, 0.03M 

sodium citrate) for five minutes each wash. The filters were rinsed twice in ethanol and 
allowed to dry before being quantified with a scintillation counter. 

Background counts per minute (CPM) for the deoxyribonucleotide and the 
ribonucleotide reactions were subtracted from the respective averaged CPM value of 
35 duplicate samples for each enzyme. The background-corrected ribonucleotide CPM 

value was divided by the background-corrected deoxyribonucleotide CPM value (Figure 
9). 
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v. Ribonucleotide sequencing with JDF-3 polymerase mutants. 

Ribonucleotides incorporated into a deoxyribonucleotide polymer are susceptible 
to alkali hydrolysis which can produce a sub-population of polymer lengths. When 
labeled primer is extended in the presence of a particular ribonucleotide base (for 
example ATP) and the four deoxyribonucleotide bases, the fragments resulting from 
alkali hydrolysis create a population of different lengths, which correspond to all the 
possible positions where ATP was incorporated. When those fragments are size 
separated, their migration pattern, with respect to other ribonucleotide base (CTP, UTP 
and GTP) hydrolysis products allows the template sequence to be read. As described 
previously, most DNA polymerases discriminate against non-conventional 
deoxynucleotides. A subset of the JDF-3 DNA polymerase mutants which allow 
improved uptake of the unconventional dideoxynucleotides also show improved tolerance 
for ribonucleotide incorporation. 

lOOng of the 38mer primer was kinased with y- 33 P according to the instructions in 
the KINACE-IT™ Kinasing Kit (Stratagene catalog #300390). 

38mer primer: 5' GGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGT 

3' 
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5 The labeled oligonucleotide was purified from contaminating free nucleotides 

with a NUC TRAP® Probe Purification Column (Statagene catalog #400701) in 10T.1E 
(lOmM Tris pH 8.0, 0.1 mM EDTA). Labeled oligonucleotide (~7 picomoles) was 
annealed to 0.09 pmoles Ml 3mpl 8+ by heating to 95°C then cooling to room 
temperature in the presence of 0.32mM MgCl 2 . 
10 Extension components 

0.054 pM annealed primer/template 
200 M each dNTP 

lx cPfu DNA polymerase buffer (Stratagene catalog #200532) 

4-200 ATP* 
15 0.1-5 Units JDF-3 polymerase* 

*Added separately 

Eight microliters of a cocktail containing the first three components listed above 
were aliquoted into a 0.2ml tube, ljil of polymerase and 1 jal of 2mM, 0.2mM or 0.4mM 
ATP were added and the reaction was incubated at 72°C for 15 minutes. The reaction 
20 volume was brought to 100)ll1 with lx cPfu polymerase buffer and transferred to a 1.5ml 

tube. After heating the reactions in the presence of 70mM NaOH for 15 minutes at 
100°C, the reaction was neutralized with 70mM HC1 and precipitated through the 
addition of lOjil 3M sodium acetate and 327.5jnl of ethanol. The samples were 
microcentrifuged for 30 minutes at 14krpm before the supernatant was removed and the 
25 pellet washed in 80% ethanol. After vacuum drying, the samples were resuspended in 

5jLtl of sequencing stop solution (95% formamide, 20mM EDTA, 0.05% bromophenol 
blue, 0.05% xylene cyanol FF) and 2.5|ul was loaded on a 6% acylamide-7M urea, lx 
TBE CASTAWAY™ Precast gel (Stratagene catalog numbers 401090 and 401094). The 
gels were run at 50 watts until the bromophenol blue dye migrated past the bottom of the 
30 gel after which the gel was fixed, dried and exposed to film for 72 hours. 

Sequencing ladders for JDF-3 550 (wild-type nucleotide incorporation) and all the 
mutants tested were visible at the 200|uM and 20jliM ATP level. At the 4jwM level, only 
the L408H and L408F mutants produced ladders (data not shown), 
vi. Sequencing with dye-dideoxynucleotide terminators 
35 Primer was extended in the presence of FAM ddCTP (NENNEL48 1). The 

sequence reactions were purified and run on an ABI 370. 

Reaction conditions for cycle-sequencing were as described below: 
IX cPFU buffer, 200 ng pBluescript II KS plasmid, 3 pmole T7 primer, 0.23 mM 
dCTP, 0.23 mM dATP, 0.23 mM dTTP, 0.23 mM dGTP with 0.046 mM FAM ddCTP. 
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The samples were cycled in a Perkin-Elmer cycler in 10 jlxI volumes for 25 cycles of the 
temperatures and times described below: 

95°C 30s 

55°C 30s 

72°C 2 min 

The samples were purified using CentriSep columns according to the 
manufacturer's instructions. After drying, the samples were resuspended in 3 jul of a 
loading dye comprised of 66.7% deionized formamide, 16.7 mg/ml Blue Dextran, and 8.3 
mM EDTA. Samples were heated at 95°C for three minutes and loaded on a 5% 
LongRangen gel in an ABI PRISM 377 DNA sequencer. 

Data was processed in Gene Scan 2. 1 . 

EXAMPLE 2 . 

Labeling of DNA. 

The modified DNA polymerases of the invention are applicable to labeling of 
DNA. It is known to those skilled in the art that there are several means by which to 
label DNA, including the incorporation of radiolabeled nucleotides. One such common 
means is by random priming, which enables one of skill in the art to generate labeled 
DNA fragments, typically about 50 to about 1000 bases long. The procedure described 
herein are adapted from F. Ausubel et al., Short Protocols in Molecular Biology, Third 
Edition, John Wiley and Sons, Inc., 1995. 

As a first step toward random priming DNA, a reaction mix containing 2.5 
microliters 0.5 mM 3dNTP (dCTP, dGTP, TTP, each at 0.5 mM), 50 nCi [ - 32 P]dATP, 1 
microliter of 3 to 8 units/microliter DNA polymerase in 50 mM Tris-HCl, pH 7.5, 10 mM 
MgCl2, 1 mM dithiothreitol, 0.05 mg/ml bovine serum albumin is prepared in a total 
volume of 1 1 microliters and incubated on ice. Next, about 30 to about 100 ng of DNA is 
mixed with about with 1 to 5 jug of random hexanucleotides in 14 microliters and boiled 
for 2 to 3 minutes and then placed on ice. The 1 1 microliter reaction mix is then added to 
the DNA/random hexamer mix, and the random priming reaction is incubated over 1 0 
minutes to as much as 4 hours at room temperature. To stop the reaction, 1 microliter 0.5 
M EDTA, 3 microliters 10 mg/ml tRNA, and 100 microliters 10 mM Tris-HCl, pH 7.4 is 
added and the mixture is extracted with phenol. The labeled DNA is then separated from 
unincorporated radioactive precursors by chromatography. 
R. Gel Assay for Dye-dideoxynucleotide Incorporation. 

A labeled oligonucleotide duplex was extended with a mixture of 
dideoxynucleotides and dye-dideoxynucleotides. When the duplex was separated on a 
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5 denaturing 20% Acrylamide/7 M urea gel, labeled oligonucleotides terminated with a 

dideoxynucleotide could be resolved from oligonucleotides terminated with dye- 
deoxynucleotides. 

Oligonucleotides : 

259C 32 P-TAACGTTGGGGGGGGGCA— ► 

10 25 8C TGCAACCCCCCCCCGTAT 

The 5' end of 259C was labeled and purified as described in Section Q.ii.a except 
that 32 Py-ATP was used. The labeled oligonucleotide 259C was at a concentration of 
approximately 0.7 ng/|ul. The complimentary oligonucleotide (258C) was added as an 
equal concentration, heated to 95°C for three minutes, 50°C for 5 minutes and room 

1 5 temperature for 20 minutes. Heat killed lysates of the relevant mutants were prepared as 

described in Example section C. The reactions were incubated in a 5 jllI volume 
composed of 30 mM Tris pH 8.0 and 3 mM MgCb with a nucleotide mixture totaling 0.1 
mM. The ratio of ddTTP to FLU ddUTP or ROXddUTP was 10:1. The dimer was 
present at a concentration of 1 .2 picomoles and 0.5 \i\ of enzyme or crude lysate or 

20 purified enzyme was added to the reaction before incubation at 50°C in the RobeCycler® 

Gradient 96 Temperature Cycler with Hot Top. The samples were incubated for 20s 
before 3 jul of a formamide based loading dye was added and the samples were heat- 
denatured at 95°C for 3 minutes then loaded onto a 20% acrylamide/7 M urea gel and 
subjected to electrophoresis at a constant 60 watts. The gel was exposed to X-ray film 

25 and the film was analyzed in the EagleEye® Eagle Sight software package. 
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7 1 . Zhao (1 999) Structure Fold Des. 7, 1 1 89. 
OTHER EMBODIMENTS 

Other embodiments will be evident to those of skill in the art. It should be 
understood that the foregoing detailed description is provided for clarity only and is 
merely exemplary. The spirit and scope of the present invention are not limited to the 
above examples, but are encompassed by the following claims. 
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CLAIMS 

1 . A purified thermostable DNA polymerase having an amino acid sequence 
presented in SEQ ID NO: 2 from residue 1 to 776. 

2. The polymerase of claim 1 that is isolated from Thermococcus species 

JDF-3. 

3. The polymerase of claim 1 that is isolated from a recombinant organism 
transformed with a vector that codes for the expression of Thermococcus species JDF-3 
DNA polymerase. 

4. A recombinant vector comprising the nucleotide sequence presented in 
SEQ ID NO: 1. 

5. An isolated recombinant polypeptide comprising the amino acid sequence 
of SEQ ID NO: 2. 

6. An isolated recombinant DNA polymerase from Thermococcus species 
JDF-3 that is 3 ' to 5 ' exonuclease deficient. 

7. The isolated recombinant DNA polymerase of claim 6, wherein said 
polymerase has a dual mutation comprising a serine to proline mutation at a site 
corresponding to S345 of SEQ ID NO: 2; and a proline to leucine mutation at a site 
corresponding to P410 of SEQ ID NO: 2. 

8. The isolated recombinant DNA polymerase of claim 6 that has an aspartic 
acid to threonine or alanine mutation at the amino acid corresponding to D 141 of SEQ ID 
NO: 2 or a glutamic acid to alanine mutation at the amino acid corresponding to El 43 of 
SEQ ID NO: 2. 

9. The isolated recombinant DNA polymerase of claim 6 that has an aspartic 
acid to threonine or alanine mutation at the amino acid corresponding to D141 of SEQ ID 
NO: 2 and a glutamic acid to alanine mutation at the amino acid corresponding to El 43 
of SEQ ID NO: 2. 

1 0. An isolated recombinant DNA polymerase having reduced 
discrimination against non-conventional nucleotides. 

11. The DNA polymerase of claim 6 or 10 that is a Family B DNA 
polymerase. 

12. The DNA polymerase of claim 6 or 10 wherein said DNA polymerase 
further comprises a mutation selected from the group consisting of: a leucine to histidine 
mutation at a site corresponding to L408 of SEQ ID NO: 2; a leucine to phenylalanine 
mutation at a site corresponding to L408 of SEQ ID NO: 2; a proline to leucine mutation 
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5 at a site corresponding to P410 of SEQ ID NO: 2; and an alanine to threonine mutation at 

a site corresponding to A485 of SEQ ID NO: 2. 

13. The DNA polymerase of claim 12 wherein said polymerase having said 
alanine to threonine mutation at said site corresponding to A485 of SEQ ID NO:2 further 
comprises a mutation selected from the group consisting of: a leucine to histidine 

10 mutation at a site corresponding to L408 of SEQ ID NO: 2; a leucine to phenylalanine 

mutation at a site corresponding to L408 of SEQ ID NO: 2; a serine to proline mutation at 
a site corresponding to S345 of SEQ ID NO: 2; and a proline to leucine mutation at a site 
corresponding to P410 of SEQ ID NO: 2. 

14. The DNA polymerase of claim 6 or 10 that has reduced discrimination 
15 against a non-conventional nucleotide selected from the group consisting of: 

dideoxynucleotides, ribonucleotides and conjugated nucleotides. 

15. The DNA polymerase of claim 14 wherein said conjugated nucleotide is 
selected from the group consisting of radiolabeled nucleotides, fluorescently labeled 
nucleotides, biotin labeled nucleotides, chemiluminescently labeled nucleotides and 

20 quantum dot labeled nucleotides. 

16. An isolated recombinant Family B DNA polymerase comprising an 
alanine to threonine mutation at the site corresponding to A485 of SEQ ID NO: 2, or a 
mutation at a site corresponding to L408, S345 or P410 of SEQ ID NO: 2, wherein said 
DNA polymerase has reduced discrimination against non-conventional nucleotides 

25 relative to the wild-type form of that polymerase. 

17. The polymerase of claim 16 that is 3' to 5' exonuclease deficient. 

18. The polymerase of claim 1 7 that has a mutation at an amino acid 
corresponding to D141 or E143 of SEQ ID NO: 2. 

19. The polymerase of claim 17 that has an aspartic acid to threonine or 
30 alanine mutation at a site corresponding to D 141 of SEQ ID NO: 2. 

20. The polymerase of claim 17 that has a glutamic acid to alanine mutation 
at a site corresponding to El 43 of SEQ ID NO: 2. 

21. The polymerase of claim 20 that has an aspartic acid to threonine or 
alanine mutation at the amino acid corresponding to D141 of SEQ ID NO: 2. 

35 22. The polymerase of claim 16 that is thermostable. 

23. The polymerase of claim 16 that is archaeal. 

24. The polymerase of claim 1 6 wherein said DNA polymerase comprises a 
leucine to histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 
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5 25. The polymerase of claim 16 wherein said DNA polymerase comprises a 

leucine to phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

26. The polymerase of claim 16 wherein said DNA polymerase comprises a 
proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

27. The polymerase of claim 16 wherein said DNA polymerase comprises a 

1 0 serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 2, wherein said 

polymerase may further comprise a mutation at a site corresponding to T604 of SEQ ID 
NO: 2. 

28. The polymerase of claim 16, wherein said DNA polymerase comprises a 
tyrosine to cysteine mutation at a site corresponding to Y497 of SEQ ID NO: 2, wherein 

15 said polyerase may further comprise an isoleucine to valine mutation at a site 

corresponding to 1630 of SEQ ID NO: 2. 

29. The polymerase of claim 16, wherein said DNA polymerase comprises a 
glutamic acid to lysine mutation at a site corresponding to E645 of SEQ ID NO: 2. 

30. The polymerase of claim 16, wherein said DNA polymerase comprises a 
20 glutamic acid to lysine mutation at a site corresponding to E578 of SEQ ID NO: 2, 

wherein said polymerase may further comprise an arginine to methionine mutation at a 
site corresponding to R465 of SEQ ID NO: 2. 

31. The polymerase of claim 16, wherein said DNA polymerase comprises a 
leucine to glutamine mutation at a site corresponding to L396 of SEQ ID NO: 2, wherein 

25 said polymerase further comprises a mutation at a site corresponding to V401, N424, 

P569, E617, or V640 of SEQ ID NO: 2. 

32. The polymerase of claim 16, wherein said DNA polymerase comprises a 
serine to asparagene mutation at a site corresponding to S651 of SEQ ID NO: 2. 

33. The polymerase of claim 16, wherein said DNA polymerase comprises a 
30 leucine to proline mutation at a site corresponding to L396 of SEQ ID NO: 2, wherein 

said polymerase may further comprise a mutation at a site corresponding to E459 of SEQ 
ID NO: 2. 

34. The polymerase of claim 16, wherein said DNA polymerase comprises a 
leucine to proline mutation at a site corresponding to L456 of SEQ ID NO: 2, wherein 

35 said polymerase may further comprise a mutation at a site corresponding to E658 of SEQ 

ID NO: 2. 

35. The polymerase of claim 16, wherein said DNA polymerase comprises a 
leucine to histidine mutation at a site corresponding to L408 of SEQ ID NO: 2, wherein 
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5 said polymerase may further comprise a mutation at a site corresponding to V437, or 

L478 ofSEQ ID NO: 2. 

36. The polymerase of claim 16 wherein said DNA polymerase comprises an 
tyrosine to asparagine mutation at a site corresponding to Y496 of SEQ ID NO: 2. 

37. The polymerase of claim 16 wherein said DNA polymerase comprises an 
10 alanine to threonine mutation at a site corresponding to A485 of SEQ ID NO: 2. 

38. The polymerase of claim 37 comprising a leucine to histidine mutation at 
a site corresponding to L408 of SEQ ID NO: 2. 

39. The polymerase of claim 37 comprising a leucine to phenylalanine 
mutation at a site corresponding to L408 of SEQ ID NO: 2. 

1 5 40. The polymerase of claim 37 comprising a proline to leucine mutation at a 

site corresponding to P410 of SEQ ID NO: 2. 

41. The polymerase of claim 16, having reduced discrimination against a 

non-conventional nucleotide selected from the group consisting of: dideoxynucleotides, 

ribonucleotides and conjugated nucleotides. 
20 42. The polymerase of claim 41 wherein said conjugated nucleotide is 

selected from the group consisting of radiolabeled nucleotides, fluorescently labeled 

nucleotides, biotin labeled nucleotides, chemiluminescently labeled nucleotides and 

quantum dot labeled nucleotides. 

43. The polymerase of claim 10 or 16 further comprising a mutation at an 
25 amino acid residue in the polymerase that corresponds to a mutation selected from the 

group consisting of: a Y to V mutation at amino acid 409 of SEQ ID NO:2; an A to C, S, 

L, I, F, or V mutation at amino acid 485 of SEQ ID NO: 2; a Y to S mutation at amino 

acid 494 of SEQ ID NO: 2; a Y to L mutation at amino acid 496 of SEQ ID NO: 2; and 

an A to Y mutation at amino acid 490 of SEQ ID NO: 2. 
30 44. The polymerase of claim 1 0 or 1 6 further comprising a mutation at an 

amino acid of the polymerase corresponding to one of amino acids 483 to 496, inclusive, 

of SEQ ID NO: 2. 

45. The polymerase of claim 44 wherein said mutation is at an amino acid of 
the polymerase corresponding to one of amino acids 485, 490, 494, or 496 of SEQ ID 

35 NO: 2. 

46. An isolated recombinant Family B DNA polymerase comprising an 
alanine to threonine mutation at an amino acid corresponding to A485T of SEQ ID NO: 2 
and and at least one substitution in the polymerase of an amino acid corresponding to 
L408, Y409, S345 or P410 respectively, of SEQ ID NO: 2. 
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47. An isolated recombinant Family B DNA polymerase comprising an 
amino acid other than A at an amino acid of the polymerase corresponding to A485 of 
SEQ ID NO: 2, and at least one substitution in the polymerase of an amino acid 
corresponding to L408, Y409, S345 or P410, respectively, of SEQ ID NO: 2. 

48. A recombinant vector comprising a nucleic acid sequence encoding the 
DNA polymerase of any one of claims 1, 5, 6, 10, 16, 46 or 47. 

49. A method of labeling a complementary strand of DNA, which method 
comprises the step of: 

contacting a template DNA molecule with: 

a recombinant Family B DNA polymerase of claim 10; and 

a non-conventional nucleotide under conditions and for a time sufficient to permit 
said DNA polymerase to synthesize a complementary DNA strand and to incorporate 
said non-conventional nucleotide into said complementary DNA strand. 

50. A method of labeling a complementary strand of DNA, which method 
comprises the step of: 

contacting a template DNA molecule with: 

a recombinant Family B DNA polymerase comprising an alanine to threonine 
mutation at a site corresponding to A485 of SEQ ID NO: 2 or a mutation at a site 
corresponding to L408 or P410 or S345 of SEQ ID NO: 2, wherein said DNA polymerase 
has reduced discrimination against non-conventional nucleotides; and 

a non-conventional nucleotide under conditions and for a time sufficient to permit 
said DNA polymerase to synthesize a complementary DNA strand and to incorporate 
said non-conventional nucleotide into said complementary DNA strand. 

5 1 . The method of claim 50 wherein said recombinant Family B DNA 
polymerase is 3'-5' exonuclease deficient. 

52. The method of any one of claims 49-51 herein said recombinant Family 
B polymerase comprises a leucine to histidine mutation at a site corresponding to amino 
acid L408 of SEQ ID NO: 2. 

53. The method of any one of claims 49-51 wherein said recombinant Family 
B polymerase comprises a leucine to phenylalanine mutation at a site corresponding to 
amino acid L408 of SEQ ID NO: 2. 

54. The method of any one of claims 49-5 1 wherein said recombinant Family 
B polymerase comprises a proline to leucine mutation at a site corresponding to amino 
acid P410 of SEQ ID NO: 2. 
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5 55. The method of any one of claims 49-51 wherein said recombinant Family 

B polymerase comprises an alanine to threonine mutation at a site corresponding to 
amino acid A485 of SEQ ID NO: 2. 

56. The method of claim 55 wherein said recombinant Family B polymerase 
comprises a leucine to histidine mutation at an amino acid corresponding to L408 of SEQ 

10 ID NO: 2. 

57. The method of claim 55 wherein said recombinant Family B polymerase 
comprises a leucine to phenylalanine mutation at an amino acid corresponding to L408 of 
SEQ ID NO: 2. 

58. The method of claim 55 wherein said recombinant Family B polymerase 
1 5 comprises a proline to leucine mutation at an amino acid corresponding to P41 0 of SEQ 

ID NO: 2. 

59. The method of claim 55 wherein said recombinant Family B polymerase 
comprises a serine to proline mutation at an amino acid corresponding to S345 of SEQ ID 
NO: 2. 

20 60. The isolated recombinant DNA polymerase of claim 55, wherein said 

polymerase has a dual mutation comprising a serine to proline mutation at a site 
corresponding to S345 of SEQ ID NO: 2; and a proline to leucine mutation at a site 
corresponding to P410 of SEQ ID NO: 2. 

61. The method of any one of claims 49-60 wherein said recombinant Family 
25 B polymerase has reduced discrimination against a non-conventional nucleotide selected 

from the group consisting of: dideoxynucleotides, ribonucleotides, and conjugated 
nucleotides. 

62. The method of claim 61 wherein said conjugated nucleotide is selected 
from the group consisting of radiolabeled nucleotides, fluorescently labeled nucleotides, 

30 biotin labeled nucleotides, chemiluminescently labeled nucleotides and quantum dot 

labeled nucleotides. 

63. A method of sequencing DNA comprising the steps of: 
contacting a template DNA strand with: 

a sequencing primer; 
35 a recombinant Family B DNA polymerase of claim 9; and 

a chain-terminating nucleotide analog, under conditions that permit said DNA 
polymerase to synthesize a complementary DNA strand and to incorporate nucleotides 
into the synthesized complementary DNA strand, wherein incorporation of a chain- 
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5 terminating nucleotide analog results in the termination of chain elongation, such that the 

nucleotide sequence of said DNA strand is determined. 

64. A method of sequencing DNA comprising the steps of: 
contacting a DNA strand with: 

a sequencing primer; 

10 a recombinant Family B DNA polymerase comprising an alanine to threonine 

mutation at a site corresponding to A485 of SEQ ID NO: 2 or a mutation at a site 
corresponding to L408 or P410 or S345 of SEQ ID NO: 2, wherein said DNA polymerase 
has reduced discrimination against non-conventional nucleotides; and 

a chain-terminating nucleotide analog, under conditions that permit said DNA 

1 5 polymerase to synthesize a complementary DNA strand and to incorporate nucleotides 

into the synthesized complementary DNA strand, wherein incorporation of a chain- 
terminating nucleotide analog results in the termination of chain elongation, such that the 
nucleotide sequence of said DNA strand is determined. 

65. The method of claim 64 wherein said recombinant Family B polymerase 
20 is deficient in 3' to 5' exonuclease activity. 

66. The method of any one of claims 63-65 wherein said recombinant Family 
B polymerase has a leucine to histidine mutation at a site corresponding to amino acid 
L408 of SEQ ID NO: 2. 

67. The method of any one of claims 63-65 wherein said recombinant Family 
25 B polymerase has a leucine to phenylalanine mutation at a site corresponding to amino 

acid L408 of SEQ ID NO: 2. 

68. The method of any one of claims 63-65 wherein said recombinant Family 
B polymerase has a proline to leucine mutation at a site corresponding to amino acid 
P410ofSEQIDNO: 2. 

30 69. The method of any one of claims 63-65 wherein said recombinant Family 

B polymerase has an alanine to threonine mutation at a site corresponding to amino acid 

A485 of SEQ ID NO: 2. 

70. The method of claim 69 wherein said recombinant Family B polymerase 

has a leucine to histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 
35 71 . The method of claim 69 wherein said recombinant Family B polymerase 

has a leucine to phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 

2. 

72. The method of claim 69 wherein said recombinant Family B polymerase 
has a proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 
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5 73. The method of claim 69 wherein said recombinant Family B polymerase 

has a serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 2. 

74. The method of any one of claims 63-65 wherein said chain-terminating 
nucleotide analog is a dideoxynucleotide. 

75. The method of claim 74 wherein said dideoxynucleotide is detectably 
10 labeled. 

76. The method of claim 75 wherein said dideoxynucleotide is fluorescently 

labeled. 

77. The method of claim 76 wherein said dideoxynucleotide is labeled with a 
moiety selected from the group consisting of fluorescein and rhodamine. 

15 78. A kit for performing the method of claim 49, 50, 5 1 , 63, or 64. 

79. A method of making a purified thermostable DNA polymerase having an 
amino acid sequence presented in SEQ ID NO: 2 from residue 1 to 776, comprising 
culturing a host cell containing the nucleic acid sequence presented in SEQ ID NO:l 
under conditions which permit production of said DNA polymerase. 

20 80. A method of making a recombinant DNA polymerase of Thermococcus 

species JDF-3 that is 3' to 5 5 exonuclease deficient, comprising culturing a host cell 
containing a nucleic acid sequence encoding said polymerase under conditions which 
permit production of said DNA polymerase. 

81. A method of making a recombinant DNA polymerase having reduced 
25 discrimination against non-conventional nucleotides, comprising culturing a host cell 

containing a nucleic acid sequence encoding said polymerase under conditions which 
permit production of said DNA polymerase. 

82. A method of making a recombinant Family B DNA polymerase 
comprising an alanine to threonine mutation at the site corresponding to A485 of SEQ ID 

30 NO: 2 or a mutation at a site corresponding to L408 or P410 or S345 of SEQ ID NO: 2, 

wherein said DNA polymerase has reduced discrimination against non-conventional 
nucleotides relative to the wild-type form of that polymerase, comprising culturing a host 
cell containing a nucleic acid sequence encoding said polymerase under conditions which 
permit production of said DNA polymerase. 

35 83. The method of claim 79, 80, 81, or 82, wherein said host cell is E. Coli, 

84. The method of claim 79, 80, 81, or 82, wherein said host cell is 
Thermococcus. 
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FIG. J. 

JDF-3 DNA polymerase nucleotide sequence; 233 1 nucleotides (SEQ ID NO: 1) 

ATGAT CCTTGACGTTGATT ACATCAC CGAGAATGGAAAGC C CGTCATCAGGGT CTTCAAGAAGGAGAACGG 
CG AGTT C AGG ATTGAATACG ACCGCGAGTTCGAG CC CTACTTCTACG CG CTCCTCAGGGACGACTCTGC CA 
TCGAAGAAATCAAAAAGATAACCGCGGAGAGGGACGGCAGGGTCGTTAAGGTTAAGCGCGCGGAGAAGGTG 
AAGAAAAAGTTCCTCGGCAGGTCTGTGGAGGTCTGGGTCCTCTACTTCACGCACCCGCAGGACGTTCCGGC 
AATCCG CGAC AAAATAAGG AAG C AC C C CG CGGTCATCGAC ATCTACGAGTACGACAT AC C CTTCGC CAAG C 
G CT AC CTCATAGAC AAGGGCCTAAT CC CGATGG AAGGTGAGGAAG AG C TTAAACTC ATGT C C TTCGACAT C 
G AG ACG CT CT ACCACGAGGG AGAAGAGTTTGG AAC CGGG C CGATTCTG ATG ATAAG CTACG C CG AT G AAAG 
CGAGGCGCGCGTGATAACCTGGAAGAAGATCGACCTTCCTTACGTTGAGGTTGTCTCCACCGAGAAGGAGA 
TGATTAAGCGCTTCTrGAGGGTCGTTAAGGAGAAGGACCCGGACGTGCTGATAACATACAACGGCGACAAC 
TTCGACTTCGCCTACCTGAAAAAGCGCTGTGAGAAGCTTGGCGTGAGCTTTACCCTCGGGAGGGACGGGAG 
CGAGCCGAAGATACAGCGCATGGGGGACAGGTTTGCGGTCGAGGTGAAGGGCAGGGTACACTTCGACCTTT 
ATCCAGTCATAAGGCGCACCATAAACCTCCCGACCTACACCCTTGAGGCTGTATACGAGGCGGTTTTCGGC 
AAGC C C AAGGAGAAGGTCT ACG CCGAGG AGATAGC C ACCGC CTGGGAGAC CGGC GAGGGGCTTGAGAGGGT 
CGCG CGCTAC TC G ATGG AGGACGCG AGGGTT AC C TACG AGCTTGGC AGGG AGTT CT TC C CGATGGAGG C C C 
AGCTTTCCAGGCTCArCGGCCAAGGCCTCTGGGACGTTTCCCGCTCCAGCACCGGCAACCTCGTCGAGTGG 
TTCC TCC TAAGGAAGGC CTACGAGAGGAACGAACTC G CTCC CAACAAG C C CGAC GAGAGGGAGCT GGCGAG 
GAGAAGGGGGGGCTACgcCGGTGGCTACGTCAAGGAGCCGGAGCGGGGACTGTGGGACAATATCGTGTATC 
TAG ACTT TCGTAGT CTCTAC C CTT C AATCATAATCAC C CAC AACGT C T CGC CAGAT ACGCTCAAC CG CGAG 
GGGTGTAGGAGCTACGACGTTGCCCCCGAGGTCGGTCACAAGTTCTGCAAGGACTTCCCCGGCTTCATTCC 
GAG C CTG CT CGGAAACCTGCTGGAGG AAAGG CAGAAG ATAAAGAGG AAGATG AAGGCAACTCTCG AC CCGC 
TGG AGAAGAATCT C CTCG ATT AC AGGC AACG CG C CATC AAGATTCT CG CC AACAG CT ACT ACGG C TACTAC 
GGC T ATG CC AGG GCAAGATGGTAC TGC AGGG AGTGCGC CGAGAGCGTTACGGCATGGGGAAGGG AGTACAT 
CGAAATGGTCATCASAGAGCTTGAGGAAAAGTTCGGTTTTAAAGTCCTCTATGCAGACACAGACGGTCTCC 
ATG CCAC C ATT C CTGGAGCGGACG C TGAAAC AGTCAAGAAAAAGGC AATGG AGTTCTTAAACTAT AT C AAT 
CCCAAACTGCCCGGCCTTCTCGAACTCGAATACGAGGGCTTCTACGTCAGGGGCTTCTTCGTCACGAAGAA 
AAAGTAC G CGGT CATCG ACGAGG AGGGCAAGATAAC C ACG CGCGGG CTTGAGATAG' \GGCGCGACTGGA 
GCGAGATAGCGAAGGAGACGCAGGCGAGGGTTTTGGAGaCGATACTCAGGCACGGTG^CGTTGAAGAGGCC 
GT C AGAAT T GTC AGGGAAGT C AC CG AAAAG CTGAGC AAGTACGAGG TTCCG C C GG AG AAG CTGGTTATCC A 
CG AGCAG ATAACG CGCG AG C TC AAGGAC TAC AAGG C CAC C GG C C C G CACGT AGC C AT AGCG AAg C GTTTG G 
CCGCCAGAGGTGTrAAAATCCGGCCCGGAACTGTGATAAGCTACATCGTTCTGAAGGGCTCCGGAAGGATA 
GG CGAC AGGG CG ATTC C C TT C GACGAGTTC G ACC CGACG AAG CACAACJTACGATGC GGACTAC T ACATCG A 
GAACCAGGTTCTGCCGGCAGTTGAGAGAATCCTCAGGGCCTTCGGCrACCGCAAGGAAGACCTGCGCTACC 
AG AAG ACGAGG CAGGTCGGGCTT GGC GCGTGGCTGAAGCCGAAGGGG AAG AAG AAG TGA 
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FIG. 2. 

JDF-3 DNA polymerase amino acid sequence (SEQ ID NO: 2) 
Theoretical molecular weight: 90.3 kD 



MILDVDYITENGKPVIRVFKKENGEFRIEYDREFEPYFY^ 
KKKFLGRSVEVWVLYFTHPQDVPAIRDKIRKHPAVIDIYETO 
ETLYHEGEEFGTGPILMISYADESEARVITWKKIDLPYVEVVSra 
FDFAYLKKRCEKLGVSFTLGRDGSEPKIQRMGDRFAVEVKGRVHFDLYPVI 

KPKEKVYAEE I ATAWETGEGLERVARYS MEDARVTYELGREFFPMEAQLS RL I GQGLWDVSRS S TGNliVEW 
FLLRKAYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSIiYPS 1 I ITHNVS PDTLNRE 
GCRS YDVAPEVGHKFCKDFPGFI PSLLGNLLEERQKIKRKMKATLDPLEKl^LCYRQRAI KI LANSYYGYY 
G YARARW YCRE C AE S VTAWGRE Y I EMV I RE LE EKFG FKVL YADTDGLHAT I P GADAE TVKKKAME FLNY I N 
PKLPGLLELEYEGFYTOGFFVTKKKYAVIDEEGKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEA 
VRIVREVTEKXiSKYEVFPEKLVIHEQITR^ 

GDRAIPFDEFDPTKHKYDADYYIENQVLPAVERILRAFGYRKEDLRYQKTRQVGLGAWLKPKGKKK 
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FIG. 3. 

JDF-3 DNA polymerase with intein sequence (SEQ ID NO: 3) 

MILDVDYITENGKPVIRVFKKENGEFRIEYDREFEPYFYALIiRDDSAIEE 

IKKITA£RHGRVVKVKRAEK\nCKKFL.GRS VEVWVLYFTHPQDVPAI RDKI 

RKHPAVIDI YEYDIPFAKRYLIDKGLI PMEGEEELKLMS FD IETLYHEGE 

EFGTGPILMISYADESEARVITWKKIDLPYVEWSTEKEMIKRFLRVVKE 

KDPDVL I TYNGDNFDFAYLKKRCEKLGVS FTLGRDGSE PKI QRMGDRFAV Extein 1 

EVKGRVHFDLYPVIRRTINLPTYTLEAVYEAVFGKPKEKVYAEEIATAWE 

TGEGLERVARYSMEDARVTYELGREFFPMEAQLSRLIGQGLWDVSRSSTG 

m^VEWFLLRKAYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNI 

VYIiDFRSLYPS 1 1 ITHNVS PDTLNREGCRS YDVAPEVGHKFCKDFPGFI P 

S LLGNliLEERQKI KRKMKATIjDPLEKNLLD YRQRAIKI liAN 

SLLPGEWVA 

VIEGGKLRPVRIGELVDGLMEASGERVKRDGDTEVLEVEGLYASPSTGSP 

RKPAQCR * KP * * GTAMPGKFTE * liSTPEGGLSVTRGHS LFAYRDAS LWR* ' 

RGRRRFKPGDLLAVPSG*PSRRGGRGSTSLNCSSNCPRRKRPTCHRHSGK 

GRKNF FRGMLRT LRW I F GEE KTGGRP GATWS TLRGLG YVKLRKI GYGWD 

REGLGKVPRFYERLVEVIRYNGNRGEFIADFNALRPVLRLMMPEKELEEW Intein 1 

LVGTRNGFRIRPFIEVDWKFAIQjLGYYVSEGSAGKWia^RTGGWSYSVRLY 

NEDGSVIJDDMERIARSSLGA*ARGELRRDFKEDGLHNLRGALRFTGREQE 

GSVAYLHVP*GGPLGLP*GVLHRRRRRSPEQDGSAIiHQERASG*RPRPAP 

ELAGRLSDKRPPRQRGLQGLRERGTALYRVPEAEERLTYSHVIPREVLEE 

TSAGPSRRT*VTGNSGSWWKAGSSTRKGPVG*AGSSTGI*SSTGSRKSGR 

KATRGTS TT * ALRRTRTS GGLWVPLRAQX 

SYYGYYGYARARWYCRECAES 
VTAWGREYIEMVIRELEEKFGFKVLYADTDGLHATI PGADAETVKKKAME 
FliNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEEGKITTRGLEIVR 
RDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKLVI Extein 2 
HEQITRELKDYKATGPHVAIAKRIJU^RGVKIRPGTVISYIVIiKGSGRIGD 
RAI PFDEFDPTKHKYDADYYIENQVXi PAVER ILRAFGYRKEDLRYQKTRQ 
VGLGAWLKPKGKKK 



3/17 



WO 01/32887 



PCT/US00/29706 



FIG. 4. 

JDF-3 DNA polymerase genomic sequence (SEQ ID NO: 4) 

AATTCCACTGCCGTGTTTAACCTTTCCACCGTTGAACTTGAGGGTGATTT 
TCTGAGCCTCCTCAATCACTTAATCGAGACCGCGGATTACCTTGAACTGG 
TACACGTTCAACGATTCGGTTCTTGTAATGGTC&ATACTGGGCCGTGCTG 

GATTTTCTAAACGTCTCAAGAACGGCTTTCATCAACGGAAACTGCCACGT 5* untranslated sequence 

CTCCGCCGTCGTGAGGGTTAAACCTGAAGTTCAAGACTTTGCAACGGAAT 

GGCGAGAGAACGGCGACTACCCCAGTGGAAGAGCTTTTGAAAGCCAAAGC 

CGAGCTTCAGCGAATGTGCGGTGCCCTTGTTCAAGAGTTGTGAGCCCTTG 

ATTGTTGTTTTCTCCTCTTTTCTGATAACATCGATGGCGAAGTTTATTAG 

TTCTCAGTTCGATAATCAGGCAGGTGTTGGTC 

ATGATCCTTGACGTTGAT 
TACATCACCGAGAATGGAAAGCCCGTCATCAGGGTCTTCAAGAAGGAGAA 
CGGCGAGTTCAGGATTGAATACGACCGCGAGTTCGAGCCCTACTTCTACG 
CGCTCCTCAGGGACGACTCTGCCATCGAAGAAATCAAAAAGATAACCGCG 
GAGAGGCACGGCAGGGT CGTTAAGGTTAAG CG CGCGGAGAAGGTGAAGAA 
AAAGTTCCTCGGCAGGTCTGTGGAGGTCTGGGTCCTCTACTTCACGCACC 
C G CAGGACGT TC CGG CAATC CGCGAC AAAAT AAGGAAG CAC C C CGCGGTC 
ATCGACATCTACGAGTACGACATACCCTTCGCCAAGCGCTACCTCATAGA 
CAAGGGCCTAATCCCGATGGAAGGTGAGGAAGAGCTTAAACTCATGTCCT 
TCGACATCGAGACGCTCTACCACGAGGGAGAAGAGTTTGGAACCGGGCCG 
ATTCTGATGATAAGCTACGCCGATGAAAGCGAGGCGCGCGTGATAACCTG 
GAAGAAGATCGACCTTCCTTACGTTGAGGTTGTCTCCACCGAGAAGGAGA 
TGATTAAGCGCTTCTTGAGGGTCGTTAAGGAGAAGGACCCGGACGTGCTG 
ATAACATACAACGGCGACAACTTCGACTTCGCCTACCTGAAAAAGCGCTG 
TGAGAAGCTTGGCGTGAGCTTTACCCTCGGGAGGGACGGGAGCGAGCCGA Extein i 
AGATACAGCGCATGGGGGACAGGTTTGCGGTCGAGGTGAAGGGCAGGGTA 
CACTTCGACCTTTATCCAGTCATAAGGCGCACCATAAACCTCCCGACCTA 
CACCCTTGAGGCTGTATACGAGGCGGTTTTCGGCAAGCCCAAGGAGAAGG 
TCTACGCCGAGGAGATAGCCACCGCCTGGGAGACCGGCGAGGGGCTTGAG 
AGGGTCGCGCGCTACTCGATGGAGGACGCGAGGGTTACCTACGAGCTTGG 
CAGGGAGTTCTTCCCGATGGAGGCCCAGCTTTCCAGGCTCATCGGCCAAG 
GCCTCTGGGACGTTTCCCGCTCCAGCACCGGCAACCTCGTCGAGTGGTTC 
CT C C TAAGG AAG GC CT AC G AG AGG AAC G AAC T CG CT C C CAAC AAG C C C G A 
CGAGAGGGAGCTGGCGAGGAGAAGGGGGGGCTACGCCGGTGGCTACGTCA 
AGGAGCCGGAGCGGGGACTGTGGGACAATATCGTGTATCTAGACTTTCGT 
AGTCTCTACCCTTCAATCATAATCACCCACAACGTCTCGCCAGATACGCT 
CAACCGCGAGGGGTGTAGGAGCTACGACGTTGCCCCCGAGGTCGGTCACA 
AGTTCTGCAAGGACTTCCCCGGCTTCATTCCGAGCCTGCTCGGAAACCTG 
C T GG AGG AAAGG C AGAAG ATAAAGAG G AAG ATGAAGG CAAC T CTCG ACC C 
GCTGGAGAAGAATCTCCTCGATTACAGGCAACGCGCCATCAAGATTCTCG 
CCAAC 

AGCCTTCTTCCCGGGGAGTGGGTTGCGGTCATTGAAGGGGGGAAA 
CTCAGGCCCGTCCGCATCGGCGAGCTGGTTGATGGACTGATGGAAGCCAG 
CGGGGAGAGGGTGAAAAGAGACGGCGACACCGAGGTCCTTGAAGTCGAGG 
GGCTTTACGCCTCTCCTTCGACAGGGAGTCCAAGAAAGCCCGCACAATGC 
CGGTGAAAGCCGTGATAAGGCACCGCTATGCCGGGGAAGTTTACAGAATA 
GCTCTCAACTCCGGAAGGAGGATTAAGCGTGACGCGCGGCCACAGCCTCT Intein 1 
TCGCGTACCGGGACGCGAGCTTGTGGAGGTGACGGGGGAGGAGGAGGTTC 
AAGCCCGGCGACCTCCTGGCGGTGCCAAGCGGATAACCCTCCCGGAGAGG 
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AGGGAGAGGCTCAACATCGTTGAACTGCTCCTCGAACTGCCCGAGGAGGA , 

AACGGCCGACATGTCATCGACATTCCGGCAAGGGTAGAAAGAACTTCTTC 

AGGGGAATGCTCAGAACCCTCCGCTGGATTTTCGGGGAGGAGAAGACCGG Intein 1 

AGGGCGGCCAGGCGCTACCTGGAGCACCTTGCGTGGGCTCGGCTACGTGA 

AGCTGAGGAAAATCGGCTACGGGGTGGTTGATAGGGAGGGACTGGGAAAG 

GTACCGCGCTTCTACGAGAGGCTCGTGGAGGTAATCCGCTACAACGGCAA 

CAGGGGGGAGTTCATCGCCGATTTCAACGCGCTCCGCCCCGTCCTCCGCC 

TGATGATGCCCGAGAAGGAGCTTGAAGAGTGGCTCGTTGGGACGAGGAAC 

GGGTTCAGGATAAGGCCGTTCATAGAGGTTGATTGGAAGTTCGCAAAGCT 

CCTCGGCTACTACGTGAGCGAGGGGAGCGCCGGGAAGTGGAAAAACCGGA 

CCGGGGGCTGGAGCTACTCGGTGAGGCTTTACAACGAGGACGGGAGCGTT 

CTCGACGACATGGAGAGACTCGCGAGGAGTTCTTTGGGGGCGTGAGCGCG 

GGGGGAACTACGTCGAGATTTCAAAGAAGATGGCCTACATAATCTTCGAG 

GGGCTCTGCGGTTCACCGGCCGAGAACAAGAGGGTTCCGTGGCTTATCTT 

CACGTCCCCTGAGGAGGTCCGCTGGGCCTTCCTTGAGGGGTACTTCATCG 

GCGACGGCGACGTTCACCCGAGCAAGATGGTTCGGCTCTCCACCAAGAGC 

GAGCTTCTGGCTAACGGCCTCGTCCTGCTCCTGAACTCGCTGGGCGTCTC 

AGCGATAAACGTCCGCCACGACAGCGGGGTTTACAGGGTCTACGTGAACG 

AGGAACTGCCCTTTACAGAGTACCGGAAGCGGAAGAACGCCTCACTTACT 

CCCACGTCATACCGAGGGAAGTGCTGGAGGAGACTTCGGCCGGGCCTTCC 

AGAAGAACATGAGTCACGGGAAATTCAGGGAGCTGGTGGAAAGCGGGGAG 

CTCGACGCGGAAAGGGCCGGTAGGATAGGCTGGCTCCTCGACGGGGATAT 

AGTCCTCGACAGGGTCTCGGAAGTCAGGAAGGAAAGCTACGAGGGGTACG 

TCTACGACCTGAGCGTTGAGGAGGACGAGAACTTCTGGCGGGCTTTGGGT 

TCCTCTACGCGCACAACNN 

AGCTACTACGGCTACTACGGCTATGCCAGGG 
CAAGATGGTACTGCAGGGAGTGCGCCGAGAGCGTTACGGCATGGGGAAGG 
GAG TACATCGAAATGG T CATCAG AGAGCTTGAGGAAAAGTT CGGTTTTAA 
AGT C C T C T ATGC AGACACAGAC GGTC TC CATG C C AC CATTC CTGGAGCGG 
ACG CTGAAACAGTCAAG AAAAAGG CAATGGAGTT CTTAAACTATATCAAT 
CCCAAACTGCCCGGCCTTCTCGAACTCGAATACGAGGGCTTCTACGTCAG 
GGGCTTCTTCGTCACGAAGAAAAAGTACGCGGTCATCGACGAGGAGGGCA 
AGATAACCACGCGCGGGCTTGAGATAGTCAGGCGCGACTGGAGCGAGATA 
GCGAAGGAGACGCAGGCGAGGGTTTTGGAGGCGATACTCAGGCACGGTGA Extein 2 
CGTTGAAGAGGCCGTCAGAATTGTCAGGGAAGTCACCGAAAAGCTGAGCA 
AGTACGAGGTTCCGCCGGAGAAGCTGGTTATCCACGAGCAGATAACGCGC 
GAGCTCAAGGACTACAAGGC CACCGGCC CGCACGTAGC CATAGCGAAGCG 
TTTGGCCGCCAGAGGTGTTAAAATCCGGCCCGGAACTGTGATAAGCTACA 
TCGTTCTGAAGGGCTCCGGAAGGATAGGCGACAGGGCGATTCCCTTCGAC 
GAGTTCGACCCGACGAAGCACAAGTACGATGCGGACTACTACATCGAGAA 
CCAGGTTCTGCCGGCAGTTGAGAGAATCCTCAGGGCCTTCGGCTACCGCA 
AGGAAG AC CTG CGC TACCAG AAG ACG AGGC AGGT CG GGCTTGGCG C GTGG 
CTGAAGC CGAAGGGGAAGAAG AAGTG A 

GGAATTATCTGGTTTCTTTTCCC 
AGCATTAAATGCTTCCGACATTGCCTTATTTATGAAACTCCTGTTGTGCC 
TGAGTTTGTGCCAGAAAACAGCCTGTTCTGACGGCGCTTTTTCTTGCCAG 
GTCTCTTGAGTTTCGCAAGGGTCTTCTCGACCAGCTCAATGGTCTTGTCG 
TCATTGTTTNlTONimNNNNNN^^ 

GGTAAT AGAC AG GG ATT C C TT CC TCAAGGAC TTC CCGGG AGG CATTGGAG 
TTTTTTGGTGGGGCTTTCACAGGATTTGCTCATCTTGTGGATTTCTCGTT 
CGATTGAATCTGTCCACTTGAGGGTGTAGGTCGAGACGGTGGAGCGCGTA 
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TTCCGGGAGCGGGTCTTGAGGCTCCATTTTTCAGTCCTCCTCCGGCGAAG 3' Untranslated sequence 

AAGTGGAACTCAAGCCGGGTGTTAGCTTATGTTATGTTCCCAACTCCTCC 

AGCACCTCCAGGATCCCCTCAATCCCGGAACCTCGAAGCCCCTCTCGTGG 

ATCTTTCTAACTTCCTCTGCCTCCGGGTTTATCCAGACCGCCCACATGCC 

GGCTCTCAGCGCACCCTCGAAATCCTCCGCGTAGGTGTCGCCGATGTGGA 

TTGCCTCGTCCGGCTCGACCCCGAAGCATCGAGCGGTTTTCTGAACATCT 

CGGGCATCGGCTTATACGCCAGAACCTCGTCGGCGAAGAAGGTTCCCTCA 

ATGTAGTCCATCAGGCCGAACCTCTCC5AGGGGGGGCCCGGTACCCAATTC 

GCCCTATAGTGAGTCGATTACAATTCACTGGCCGTCGTTTTACAACGTCG 

TGACTGGGAAAACCCTGGCGTTACCCAACTTAAGTCGCTTTGCAGCACAT 

CCCCC 
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Preliminary Qualification of Mutants 
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Figure 5 
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Sequencing with Purified Mutants 




Figure 6 
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Sequencing with Dye-labeled Dideoxynucleotides 
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Figure 7 
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Sequencing with the P410L, A485T Double Mutant 
and cc- 33 P Dideoxynucleotides 




Figure 8 
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6B : JDF3 S345P FAM ddCTP mix 1 5 / 



Figure 10 
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33 P - TAACGTTGGGGGGGGGCA-> 
TGCAACCCCCCCCCGTAT 
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Figure 12 
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Figure 13 
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DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSIiIiGNLLEERQKIKRKMKATIiDPLEKWLLD 
DTL0REGCRSYDVAPEVGHKFCK!)FPGFIPSLLGIsnLLEE^QKIKRKMKATLDPLEKNLLD 
DTLNREGCRS YDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEIGtfLLD 
DTLNREGCRS YDVAPEVGHKFCKI^FPGFIPSLLGNLLEERQKIKjjKMKATLDPLEKNLLD 
DTLNREGCRSYDVAPEVGHKFCKIJFPGFIPSIjIiGNIjIjEERQKIKRKMKATLDPLEKNLLD 
DTLNREGCRS YD VAPE VGHKF C KD F PG F I PS LLGNLLE ERQ KI KR KM KATLD P LE KNLLD 
DTLNREGCRS YDVAPEVGHKFCKDFPGFI PS LLGNLLE ERQ KI KRKMKATLDPLE KNLLD 
DTLNREGCRSY^APEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKmiLD 
DTLNREGCRSYgVAPEVGHKFCKBFPGFIPSLLGlvrLLE^RQKIKRK^KATLDPLEKNLL^ 
DTLNREGCRS YDVAPEVGHKFCKDFPGFI PS LLGNLLEERQKIKR KM KATLDPLE KNLLD 
DTLNREGCRS YDVAPEVGHKFCKDF PGFI PSLLGNQLEERQKIKRKMKATLDPLE KNLLD 
DTLNREGCRSYDVAPESGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKJjjgLD 
DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 
DTLNREGCRS YDVAPEVGHKFCKDFPGFI PS LLGNLLE ERQKIKRKMKATLPPLEKNLLD 



Figure 14 
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Y^R^^SlT^^G^GYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAI KILANS YYG YYGYARARWYCRECAESVTAWGREY I EMVI RELEEKFGFKVLYAD 
YRQRAI KILANSYYGCTGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKILANjSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAI KILANS YYG YYG YARARWYCRECAESVTAWGREY I EMVI RELEEKFGFKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREY I EMVI RELEEKFGFKVLYAD 
YRQRAI KILANS YYG YYGYARARWYCRECAESVTAWGREY I EMVI RELEEKFGFKVLYAD 
YRQRA I KI L ANS YYG YYG YARARWYCRECAESVTAWGREY I EMVIRELEEKFGFKVLY AD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAI KILANS YYG YYG YARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKI LANS YYGYYG YARARWYCRECAESVTAWGREY IEMVIRELEEKFGFKVLYAD 
YRQRAIKILANSYYGgjYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 



tdglhatipga^ 

TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFL^INPKLPGLLELgYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 

tdglhatipgadaetvkkkameflistyinWklpglleleyegfyvrgffvtkkk^avidee 
tdglhatipgadaetvkkkameflnyinpklpglleleyegfyvrgffvtkkkyavidee 
tdglhatipgadaetvkjocameflnyinpklpglleleyegfyvrgffvtkkkyavidee 
tdglhati pgadaetvkkkameflnyi npklpgllele yegfyvrgffvtkkkyavidee 
tdglhatipgadaetvkkkameflnyinpklpgllegeyegfyvrgffvtkkkyavidee 
tdglhatipgadaetvkkkameflnyinpklpglleleyegfyvrgffvtkkkyavidee 
tdglhatipgadaetvkxkameflnyinpklpglleleyegfyvrgffvtkkkyavidee 
tdglhat i pg adaetvkkkameflnyinpklpgllele yeg f yvrg f fvt kkkyavide e 
tdglhatipgadaetvkkkameflnyinpklpglleleyegfyvrgffvtkkkyavidee 



GKITTRGLEIVRRDWSEIAKETQARVLEAVJLRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEjgL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVRjgVTEKLSKYEVPPEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHgDVEEAVRIVREVTEKLSKYEVPPEKL 
GKITTRGLEIVRRDWSglAKETQARVLEAILRHGDVEEAgRIVREVTEKLSKYEVPPEKL 
GKlgTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLjgjKYEVPPEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPP ffiKL 
GKITTRGLEIWRDWSEIAKETQARVLEAILRHGDVEEAWIVREVTEKLSKYEVPPggg 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKITTRGLE I VRRDWS E I AKETQ ARVLEA I LRHGD VE E AVR I VR E VTE KL S K Y E V P PEKL| 



4 140 

10 141 

13 141 

16 141 

18 141 

19 141 
28 141 
34 141 
41 141 
33 141 
48 141 
55 142 
64 144 
Jdf3 481 



4 200 

10 201 

13 2 01 

16 201 

18 201 

19 201 
28 201 
34 201 
41 201 
33 201 
48 201 
55 202 
64 204 
Jd£3 541 



4 


260 


10 


261 


13 


261 


16 


261 


13 


261 


19 


261 


23 


261 


34 


261 


41 


261 


33 


261 


48 


261 


55 


262 


64 


264 


Jdf3 


601 



Figure 15 



17/17 



INTERNATIONAL SEARCH REPORT 



Int ial Application No 

PCT/US 00/29706 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C12N15/54 C12N15/70 



C12N9/12 



C12Q1/68 



C12P19/34 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

WPI Data, PAJ, CAB Data, STRAND, BIOSIS, EPO-Internal 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



WO 99 06538 A (AMERSHAM PHARM BIOTECH INC) 
11 February 1999 (1999-02-11) 



the whole document 

W0 97 39150 A (UNIV ALBERTA ;UNIV SOUTHERN 

CALIFORNIA (US)) 

23 October 1997 (1997-10-23) 



the whole document 



10,11, 
14,48, 
49,61, 
78,81, 
83,84 



10,11, 
14,48, 
49,61, 
78,81, 
83,84 



-/-- 



LH 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



° Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the international 

filing date 

"L" document which may throw doubts on priority claim (s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

■P' document published prior to the international filing date but 
later than the priority date claimed 



•T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

'X' document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

■V document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 



16 March 2001 



Date of mailing of the international search report 



29/03/2001 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 



Authorized officer 



Hornig, H 



Form PCT/ISA/210 (second sheet) (July 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



In I Application No 



PCT/US 00/29706 



C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category ° 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. i 


X 


EP 0 727 496 A (HARVARD COLLEGE) 
21 August 1996 (1996-08-21) 

the whole document 


10,11, 
14,48, 

49,61, ! 

78,81, 

83,84 


X 


EP 0 655 506 A (HARVARD COLLEGE) 
31 May 1995 (1995-05-31) 

the whole document 


10,11, 
14,48, 
49,61, 
78,81, 
83,84 


X 


GARDNER ANDREW F ET AL: "Determinants of 
nucleotide sugar recognition in an 
archaeon DNA polymerase." 
NUCLEIC ACIDS RESEARCH, 
vol. 27, no. 12, 

15 June 1999 (1999-06-15), pages 

2545-2553, XP002163101 

ISSN: 0305-1048 

cited in the application 

the whole document 


10,11, 
14,48, 
49,61, 
78,81, 
83,84 


A 


EP 0 745 676 A (AMERSHAM LIFE SCIENCE INC) 
4 December 1996 (1996-12-04) 
the whole document 




A 


W0 95 04162 A (UNIV ALBERTA ;UNIV SOUTHERN 

CALIFORNIA (US)) 

9 February 1995 (1995-02-09) 

the whole document 





Form PCT/ISA/210 (continuation of second sheet) (July 1992) 



page 2 of 2 



Information on patent family members 


In al Application No 

PCT/US 00/29706 


Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 



WO 9906538 



11-02-1999 



US 
AU 
EP 



5882904 A 
8767198 A 
1002057 A 



16-03-1999 
22-02-1999 
24-05-2000 



W0 9739150 



23-10-1997 



AU 
CA 
EP 
US 



2804597 A 
2251643 A 
0910664 A 
5945312 A 



07-11-1997 
23-10-1997 
28-04-1999 
31-08-1999 



FP 




A 

r\ 


?1 - 

C— X 


-08- 

WO 


■1996 


US 

w w 


5614365 


A 


25-03-1997 

L — ww X *S w / 














DE 

U I— 


29513622 


u 


19-10-1995 

X J X\J X ZJ Zf *J 
















2Q513639 

Jl JUJ7 


u 

w 


19-10-1995 

X ZJ X \J X Zf S *J 














JP 


2691149 


B 


17-12-1997 














JP 


8205874 


A 


13-08-1996 














AT 


143057 


T 


15-10-1996 














AU 


4193396 


A 


06-05-1996 














CA 

w f\ 


2201885 


A 


25-04-1996 














CN 

w 1 1 


1170439 


A 


14-01-1998 














DF 


69400567 


D 


24-10-1996 

£ — I X W X mf ^ w 














DE 


69400567 


T 


05_02-i997 

WW w t — JL S I 














DE 


655506 


T 


28-09-1995 












• 


DK 


655506 


T 


10_03-1997 














EP 


0655506 


A 


31_05-1995 

X W ± S w 














ES 


2072238 

L— \J / i— L— W 


T 


I6-07-1995 














FI 


971611 


A 


13-06-1997 














GR 


95300040 


T 


31-07-1995 

w X w / X ~r ■«-/ 














HU 


77037 


A 


02-03-1998 

W i — WW X «S «S V 














wo 


9612042 


A 


25-04-1996 














ZA 


9508761 


A 


i 3-05-1996 

X w WW X — ' w 


FP 




A 


O x 


-05- 


-1995 


US 


5614365 


A 


25-03-1997 

C— w ww 














AT 


143057 


T 


15-10-1996 

X w X w X -/ -/ w 














AU 


4193396 


A 


06-05-1996 














CA 


2201885 


A 


25-04-1996 














CN 


1170439 


A 


14-01-1998 














DE 


29513622 

^— — ' w X WW b k 


u 


19-10-1995 














DE 


29513639 


U 


19-10-1995 














DE 

Is I— 


69400567 


D 


24-10-1996 














DE 


69400567 


T 


06-02-1997 














DE 


655506 


T 


28-09-1995 














DK 


655506 


T 


10-03-1997 














EP 


0727496 


A 


21-08-1996 














ES 


2072238 


T 


16-07-1995 














FI 


971611 


A 


13-06-1997 














GR 


95300040 


T 


31-07-1995 














HU 


77037 


A 


02-03-1998 














JP 


2691149 


B 


17-12-1997 














JP 


8205874 


A 


13-08-1996 














WO 


9612042 


A 


25-04-1996 














ZA 


9508761 


A 


13-05-1996 


EP 


0745676 


A 


04- 


-12- 


-1996 


CA 


2222744 


A 


05-12-1996 














GB 


2301590 A,B 


11-12-1996 














JP 


11507211 


T 


29-06-1999 














WO 


9638568 


A 


05-12-1996 














US 


5885813 


A 


23-03-1999 


WO 


9504162 


A 


09- 


-02- 


-1995 


US 


5547859 


A 


20-08-1996 



Form PCT/ISA/210 (patent family annex) (Juty 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



Ini Application No 

PCT/US 00/29706 



Patent document 
cited in search report 



Publication 
date 



Patent family 



Publication 



member(s) 


date 


Al 1 

AU 


AQQ/1QQ 

oyy^yo 


D 

D 


03-12-1998 


Al 1 
AU 




A 
A 


28-02-1995 


DP 


i nni qq 
lUUloo 


A 
A 


31-12-1996 


DK 


y^u/ huo 


A 

A 


05-11-1996 


r a 

LA 


Z.LOOH/ 1 


A 

A 


09-02-1995 


LN 




A 

A 


02-10-1996 


P7 

Ll 


youuouo 


A 

A 


12-06-1996 


tr 


n7i ococ 


A 

A 


29-05-1996 


r t 
r 1 


you^oy 


A 

A 


27-03-1996 


Ul 1 

HU 


/bo41 


A 

A 


28-05-1997 




ybuyoui 


T 
1 


22-09-1997 


1 V/ 

LV 


1 14oD 


A 

A 


20-08-1996 


LV 


11486 


B 


20-02-1997 


NO 


960408 


A 


27-03-1996 


NZ 


269727 


A 


24-11-1997 


PL 


312836 


A 


13-05-1996 


SK 


14196 


A 


05-06-1996 


US 


5928919 


A 


27-07-1999 


us 


5660980 


A 


26-08-1997 



W0 9504162 



Form PCT/ISA/210 (patent family annex) (July 1992) 



page 2 of 2 



